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Non-cytotoxic Protein Conjugates 

This Invention relates to a non-cytotoxic protein conjugate, and to the use of said 
s conjugate for treating pain. 

Toxins may be generally divided into two groups according to the type of effect that 
they have on a target cell. In more detail, the first group of toxins l^ill their natural 
target cells, and are therefore known as cytotoxic toxin molecules. This group of 
10 toxins is exemplified inter alia by plant toxins such as ricin, and abrin, and by 
bacterial toxins such as diphtheria toxin, and Pseudomonas exotoxin A. Cytotoxic 
toxins typically kill their target cells by inhibiting the cellular process of protein 
synthesis. 

15 In contrast, the second group of toxins, which are known as non-cytotoxic toxins, do 
not (as their name confirms) kill their natural target ceils. Non-cytotoxic toxins have 
attracted much less commercial interest than have their cytotoxic counterparts, and 
exert their effects on a target cell by inhibiting cellular processes other than protein 
synthesis. As with their cytotoxic counterparts, non-cytotoxic toxins are produced 

20 from a variety of sources such as plants, and bacteria. Bacterial non-cytotoxic toxins 
are now described In more detail. 

Clostridial neurotoxins are proteins that typically have a molecular mass of the order 
of 150 kDa. They are produced by various species of bacteria, especially of the 
25 genus Clostridium, most importantly C. tetani and several strains of C. botulinum, C, 
butyricum and C. argentihense. There are at present eight different classes of the 
clostridial neurotoxin, namely: tetanus toxin, and botulinum neurotoxin in its 
serotypes A, B, Ci, D, E, FandG, and theyall share similar structures and modes of 
action. 

30 

Clostridial neurotoxins represent a major group of non-cytotoxic toxin molecules, and 
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are synthesised by the host bacterium as single polypeptides that are modified post- 
translationally by a proteolytic cleavage event to form two polypeptide chains joined 
together by a disulphide bond. The two chains are ternied the heavy chain (H- 
chain), which has a molecular mass of approximately 100 kDa, and the light chain 
5 (L-chain), which has a molecular mass of approximately 50 kDa. 

L-chains possess a protease function (zinc-dependent endopeptidase activity) and 
exhibit high substrate specificity for vesicle and/or plasma membrane associated 
proteins involved in the exocytic process. L-chains from different clostridial species 
10 or serotypes may hydrolyse different but specific peptide bonds In one of three 
substrate proteins, namely synaptobrevin, syntaxin or SNAP-25. These substrates 
are important components of the neurosecretory machinery. 

Non-cytotoxic toxins are also produced by other bacteria, such as from the genus 
15 Neisseria, most importantly from the species N. gonorrhoeae. For example, 
Neisseria sp. produces the non-cytotoxic toxin IgA protease (see W099/58571). 

It has been well documented in the art that toxin molecules may be re-targeted to a 
cell that is not the toxin's natural target cell. When so re-targeted, the modified toxin 

20 is capable of binding to a desired target ceil and, following subsequent translocation 
into the cytosol, is capable of exerting its effect on the target cell. Said re-targeting 
is achieved by replacing the natural Targeting Moiety (TM) of the toxin with a 
different TM. In this regard, the TM is selected so that it will bind to a desired target 
cell, and allow subsequent passage of the modified toxin into an endosome within 

25 the target cell. The modified toxin also comprises a translocation domain to enable 
entry of the non-cytotoxic protease into the cell cytosol. The translocation domain 
can be the natural translocation domain of the toxin or It can be a different 
. translocation domain obtained from a microbial protein with translocation activity. 

30 For example, in the context of non-cytotoxic toxin molecules, it has been well 
documented that a clostridial neurotoxin may be re-targeted by incorporation of a 
Targeting Moiety (TM), which is not the natural TM of a clostridial neurotoxin. The 
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described chemical conjugation and recombinant methodologies are now regarded 
as conventional, and reference is made to Hermanson. G.T. (1996), Bioconjugate 
techniques, Academic Press, and to Wong, S.S. (1991), Chemistry of protein 
conjugatien-and cross-linking, CRC Press. 

5 

For example. WO94/21300 describes modified clostridial neurotoxin molecules that 
are capable of regulating Integral Membrane Protein (IMP) density present at the cell 
surface of the target cell. The modified neurotoxin molecules are thus capable of 
controlling ceil activity (e.g. glucose uptake) of the target cell. W096/33273 and 

10 WO99/17806 describe modified clostridial neurotoxin molecules that target 
peripheral sensory afferents. The modified neurotoxin molecules are thus capable 
of demonstrating an analgesic effect. WOOO/10598 describes the preparation of 
modified clostridial neunDtoxin molecules that target mucus hypersecreting cells (or 
neuronal cells controlling said mucus hypersecreting cells), which modified 

15 neurotoxins are capable of Inhibiting hypersecretion from said cells. WO01/21 21 3 
describes modified clostridial neurotoxin molecules that target a wide range of 
different types of non-neuronal target cells. The modified molecules are thus 
capable of preventing secretion from the target cells. Additional publications in the 
technical field of re-targeted toxin molecules include: WOOO/62814; WOOO/04926; 

20 US5.773,586:. W093/1 5766; WOOO/61 1 92; and W099/58571 . 

Thus, from the above-described publications, it will be appreciated that the basic 
concept of re-targeting a non-cytotoxic protease to a desired target cell, by selecting 
a TM that has a corresponding receptor present on the target cell, has been well 
25 documented. 

However, different receptors present on a target cell of interest demonstrate different 
binding affinities for different TMs. This may be a particular problem with pain- 
sensing cells, which possess.a wide range of receptor types having different binding 
30 affinities for different TMs. Thus, a re-targeted conjugate comprising a particular TM 
(that binds to a receptor on a pain-sensing cell) may demonstrate a low binding 
... afflnity for.a painrserising.iarg^t cell^ which is undesirable. 



wo 2006/059105 



PCT/GB2005/004598 



4 

There is therefore a need to develop modified non-cytotoxic conjugates that address 
one or more of the above problems. Of particular interest is the development of an 
Hmproved conjugate-foruse in tr«ating-pain^. 

5 

The present invention seeks to address one or more of the above problems by using 
as the conjugate's Targeting Moiety (TM) an "agonisf of a receptor that is present 
on the pain-sensing target cell of interest. In preferred embodiments, the pain- 
sensing target cell is a nociceptive sensory afferent, more preferably a primary 
10 nociceptive sensory afferent. In particularly preferred embodiments, the TM is an 
agonist of the opiold-like receptor-1 (ORLi) receptor. 

Accordingly, in a first aspect, the present invention provides a non-cytotoxic 
conjugate for inhibition or reduction of exocytic fusion in a nociceptive sensory 
15 afferent cell, comprising: 

(i) a Targeting Moiety (TM), 

wherein said TM is an agonist of a receptor present on said 
20 nociceptive sensory afferent cell, and wherein said receptor 

undergoes @ndocytosis to be incorporated into an endosome 
within the nociceptive sensory afferent cell; . 

(ii) a non-cytotoxic protease or a fragment thereof, 

25 . 

wherein the protease or protease fragment is capable of 
cleaving a protein of the exocytic fusion apparatus of said 
nociceptive sensory afferent c^ll; and 

30 (iii) a Translocation Domain, 

-wherein the Translocation Domain translocates the protease or 
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protease fragment from within the endosome, across the 
endosomal membrane, and into the cytosol of the nociceptive 
sensory afferent cell. 

5 The use of an "agonist", which would nonnally stimulate a biological process, 
particularly exocytosis (for example, an increase in cellular secretion, or an up- 
regulation in membrane protein expression), is an exciting development in the 
technical field of re-targeted toxins. Furthermore, it is particularly surprising that an 
agonist may be employed in a therapeutic composition to achieve a reduction or 

10 inhibition of a biological process that the agonist would nomnally stimulate. 

The agonist-containing conjugates of the present invention represent a distinct sub- 
set of toxin conjugates. In more detail, the conjugates of the present invention 
comprise TMs that have been selected on the basis of specific agonist properties 
15 rather than on the simple basis that they have a conresponding receptor on a pain- 
sensing target cell of interest. 

Conventionally, an agonist has been considered any molecule that can either 
increase or decrease activities within a cell, namely any molecule that simply causes 
20 an alteration of cell activity. For example, the conventional meaning of an agonist 
would include: a chemical substance capable of combining with a receptor on a cell 
and initiating a reaction or activity, or a drug that induces an active response by 
activating receptors, whether the response is an Increase or decrease in cellular 
activity. 

25 

However, for the purposes of this invention, an agonist is more specifically defined 
as a molecule that Is capable of sfimulating the process of exocytic fusion in a pain- 
sensing target cell, which process is susceptible to Inhibition by a protease (or 
fragment thereof) capable of cleaving a protein of the exocytic fusion apparatus in 
30. said target cell. 

. ' - Tfi'-AcGOfSingly, the-pe^^ - ^'■■j-i^-^^, 
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many molecules that would be conventionally considered as agonists. For example, 
nerve growth factor (NGF) is an agonist in respect of its ability to promote neuronal 
differentiation via binding to a TrkA receptor. However, NGF is not an agonist when 
--assessed by the- above criteria -because it Is not a principal Inducer of exocytic 
5 fusion. In addition, the process that NGF stimulates (i.e. cell differentiation) Is not 
susceptible to inhibition by the protease activity of a non-cytotoxic toxin molecule. 



In use, an agonist-containing conjugate of the present invention does not deactivate 
an agonist receptor on a pain-sensing target cell, but rather the protease activity of 
10 the conjugate serves to negate the agonist-mediated response. 



Furthemnore, once delivered to the cytosol of the paln-sensing target cell, the 
protease component of a conjugate of the present invention Inhibits or blocks the 
action of all subsequent agonists capable of causing the same effect (I.e. Increased 

15 exocytic fusion) in the same target cell. This Is advantageous and means that the 
conjugates of the present invention have application in situations where multiple 
agonists may be responsible for causing the sensation of pain. Thus, when 
designing a conjugate of the present invention, the TM that is selected for delivery 
need not necessarily be the principal agonist involved In causing the sensation of 

20 pain. 

Agonist-mediated delivery according to the present invention provides the following 
significant advantage over previous non-cytotoxic protease-containing therapeutics: 
use of an agonist may confer preferential binding and/or internalisation properties on 
25 the conjugate. This, in turn, may result in more efficient delivery of the protease 
component to a pain-sensing target cell. 

In addition, use of an agonist as a TM Is self-limiting with respect to side-effects. In 
more detail, binding of an agonist to a pain-sensing target cell increases exocytic 
30 fusion , which may exacerbate the sensation of pain. However, the exocytic process 
that is stimulated by agonist binding is subsequently reduced or inhibited by the 
proteasecomponentof the conjugate. ^" . . . - , , 
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In preferred embodiments of the invention, the TM is an agonist of the ORLi 
receptor. The ORLi receptor is present on pain-sensing cells in the body. 

5 The ORLi receptor is a member of the G-protein-coupled class of receptors, and has 
a seven transmembrane domain structure. The properties of the ORLi receptor are 
discussed in detail in Mogil & Pasternak (2001), Pharmacological Reviews, Vol. 53, 
No. 3, pages 381-415. 

10 Throughout this specification, reference to the "ORLi receptor" embraces all 
members of the ORLi receptor family. Members of the ORLi receptor family typically 
have a seven transmembrane domain structure, and are coupled to G-proteins of 
the Gi and Go families. A method for detemiining the G-protein-stimulating activity of 
llgands of the ORLi receptor is given In Example 17. A method for measuring 

15 reduction in cellular cAMPtevels foltowing ORLfactivation is given in Example 1 6. 
A further characteristic of members of the ORLi receptor family is that they are 
typically able to bind nociceptin (the natural ligand of ORLi). As an example, all 
altemative splice variants of the ORLi receptor, are members of the ORLi receptor 
family. 

20 

The conjugates of the present invention generally demonstrate a reduced binding 
affinity (In the region of up to 100-fold) for nociceptive sensory afferent target cells 
when compared with the conresponding "free" TM. However, despite this 
observation, the conjugates of the present invention surprisingly demonstrate good 
25 efficacy. This can be attributed to two principal features. First, the non-cytotoxic 
protease component Is catalytic - thus, the therapeutic effect of a- few such 
molecules Is rapidly amplified. Secondly, the receptors present on the nociceptive 
. sensory afferents need only act as a gateway for entry of the therapeutic, and need 
not necessarily be stimulated to a level required in order to achieve a ligand-receptor 
30 mediated phannacological response. Accordingly, the conjugates of the present 
Invention may be administered at a dosage that is much lower that would be 
- — - ;-0mpl©yed for other types of. analgesic molecules «uch as NSAIDS, morphine, and 
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gabapentin. The latter molecules are typically administered at high microgram to 
milligram (even up to hundreds of milligram) quantities, whereas the conjugates of 
the present invention may be administered at much lower dosages, typically at least 
40-fold-lower,-and-more typically-at tOG-fold lower. 

5 

In a particularly preferred embodiment of the invention, the TM of the conjugate is 
noclceptin - the natural ligand for the ORL1 receptor. Nociceptin targets the ORL1 
receptor with high affinity. 

10 Examples of other preferred TMs include: 



Code 


Sequence 


Ref. 


SEQ ID No. 


Nociceptin 1-17 


FGGFTGARKSARKLANQ 


[1] 


1,2 


NociCisptin 1-1t " 


FGGFTGARKSA ' 


[1] 


3,4 


Nociceptin [YIOJI-H 


FGGFTGARKYA 


[1] 


5,6 


Nociceptin [Ylljl-H 


FGGFTGARKSY 


[1] 


7.8 


Nociceptin IY14]1-17 


FGGFTGARKSARKYANQ 


[1] 


9,10 


Nociceptin 1-13 


FGGFTGARKSARK 


[2] 


11,12 


Nociceptin [R14K15] 
1-17 (also known as 
"variant" nociceptin) 


FGGFTGARKSARKRKNQ 


[3.41 


13,14 


Nociceptin I-I3-NH2 


FGGFTGARKSARK-NH2 


[5] 




Nociceptin 

Phe (P-NO2) 1-17 


(pN02)FGGFTGARKSARKLANQ 


[5] 




Lofentanll 


Non-peptide agonists 


[5] 


m 
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Code 


Sequence 


Ref. 


SEQ ID No. 


Etorphine 


Non-peptide agonists 


[5] 




Peptide agonist 


Peptide agonists from combinatorial 
library approach 


[6] 





[1] Mogil & Pasternak, 2001, Phamiacol. Rev., 53, 381-415 
[2] Maileet a!., 2003, Neurosci. Lett., 350, 190-192 
[3] Rizzi at al., 2002, J. Pharmacol. Exp. Therap., 300, 57-63 
5 [4] Okada et al., 2000. Biochem. Biophys. Res. Commun., 278, 493-498 
[5] Zaveri, 2003, Life Sci., 73, 663-678. 
[61 Dooley et al., 1997, J Phamiacol Exp Ther. 283(2), 735-41 . 

The TM preferably comprises a maximum of 50 amino acid residues, more 
10 preferably a maximum of 40 amino acid residues, particularly preferably a maximum 
(5f 30 amino acid residues; ^nd most preferabty "a" maximum of 20 amino acid 
residues. For example, nociceptin is a 17 amino acid residue peptide. 

The above-identified "varianf TM demonstrates particularly good binding affinity 
15 (when compared with natural nociceptin) for nociceptive sensory afferents. Generally 
speaking, a TM-containing conjugate will demonstrate an approximate 100-fold 
reduction In binding ability vis-a-vis the TM perse. The above-mentioned "variant" 
TM perse demonstrates an approximate 3- to 1 0-fold increase in binding ability for a 
nociceptive sensory afferent vis-a-vis natural nociceptin. Thus, a "variant" TM- 
20 containing fusion might be expected to demonstrate an approximate 10-fold 
reduction in binding ability for a nociceptive sensory afferent vis-ii-vis 'free' 
nociceptin. However, the present inventors have demonstrated that conjugates 
comprising said "variant" TM demonstrate a binding ability that (most surprisingly) 
closely minrors that of Iree' nociceptin - see Figure 1 7. 

25 

In the context of the present invention, the terni agonist of the ORLi receptor (such 
as nociceptin, or any one of the peptides listed in the table above) embraces 
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molecules having at least 70%, preferably at least 80%, more preferably at least 
90%, and most preferably at least 95% homology with said agonist. The agonist 
homologues retain the agonist properties of nociceptin at the ORLi receptor, which 
- may-be-tested using the-methods provided in Example 10. 

5 

The invention also encompasses fragments, variants, and derivatives of any one of 
the TMs described above. These fragments, variante, and derivatives will 
substantially retain the properties that are ascribed to said TMs. 

10 The agonist properties of a TIVI can be confirmed using the methods described in 
Example 1 . These methods are based on previous experiments (see Inoue et al. 
(1998) Proc. Natl. Acad. Sci., 95, 10949-10953), which confirm that the natural 
agonist of the ORLi receptor, nociceptin, causes the induction of substance P 
release from nociceptive primary afferent neurons. This is supported by the facts 

15 that: 



> the nociceptin-induced responses are abolished by specific NK1 
receptor (the substance P receptor) antagonists; and 

20 > pre-treatment of the cells with capsaicin (which depletes substance P 
from small diameter primary afferent neurons) attenuates the 
nociceptin-induced responses. 

Similarly, Inoue et al. confirm that an intraplantar injection of botulinum neurotoxin 
25 type A abolishes the nociceptin-induced responses. Since it is known that BoNT 
inhibits the release of substance P from primary afferent neurons (Welch et al., 
(2000), Toxicon, 38, 245-258), this confinms the link between nociceptin-ORLi 
. interaction and subsequent release of substance P. 

30 Thus, a TM can be said to have agonist activity at the ORLi receptor If the TM 
causes an induction in the release of substance P from a nociceptive sensory 
■ ■ afferentneuron (see Example 1). 
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In another embodiment, opioids represent a preferred group of TMs of the present 
invention. Within this family of peptides is included enkephalins (met and leu). 
endomoTJhins 1 and 2, p-endorphin and dynorphin. Opioid peptides are frequently 

5 used in the clinic to modify the activity to nociceptors, and other cells involved in the 
pain response. As exemplified by the three-step World Health Organisation 
Analgesic Ladder, opioids have entry points into the phamiacological treatment of 
chronic cancer and non-cancer pain at all three stages, underlining their importance 
to the treatment of pain. Reference to opioids embraces fragments, variants and 

10 derivatives thereof, which retain the ability to bind to nociceptive sensory afferents. 
The protease of the present invention embraces ail naturally-occum'ng non-cytotoxlc 
proteases that are capable of cleaving one or more proteins of the exocytic fusion 
apparatus in eukaryotic cells. 

IS The protease of the present invention is preferably a bacterial protease. 

More preferably, the bacterial protease is selected from the genera Clostridium or 
Neisseria (e.g. a clostridial L-chain, or a neisserial IgA protease preferably from N. 
gonorrhoeae). 

20 

The present Invention also embraces modified non-cytotoxlc proteases, which 
Include amino acid sequences that do not occur in nature and/or synthetic amino 
acid residues, so long as the modified proteases still demonstrate the above- 
mentioned protease activity. 

25 

The protease of the present invention preferably demonstrates a serine or 
metalloprotease activity (e.g. endopeptidase activity). The protease is preferably 
. specific for a SNARE protein (e.g. SNAP-25, synaptobrevinA/AMP, or syntaxin). 

30 Particular mention is made to the protease domains of neurotoxins, for example the 
protease domains of bacterial neurotoxins. Thus, the present invention embraces 
-■^ ^he use of neurotoxin domains, which occur in nature, as well as recombinantly 
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prepared versions of said naturally-occurring neurotoxins. 

Exemplary neurotoxins are produced by Clostridia, and the ternn clostridial neurotoxin 
embraces neurotoxins produced by -G~tetani (TeNT), and by C. botulinum (BoNT) 
5 serotypes A-G, as well as the closely related BoNT-like neurotoxins produced by C. 
bamtii and C. butyricum. The above-mentioned abbreviations are used throughout 
the present specification. For example, the nomenclature BoNT/A denotes the 
source of neurotoxin as BoNT (serotype A). Conresponding nomenclature applies to 
other BoNT serotypes. 

10 

The temn L-chain fragment means a component of the L-chain of a neurotoxin , which 
fragment demonstrates a metalloprotease activity and is capable of proteolytically 
cleaving a vesicle and/or plasma membrane associated protein Involved in cellular 
exocytosis. 

15 

A Translocation Domain is a molecule that enables translocation of a protease (or 
fragment thereof) into a pain-sensing target cell such that a functional expression of 
protease activity occurs within the cytosol of the target cell. Whether any molecule 
(e.g. a protein or peptide) possesses the requisite translocation function of the 
20 present Invention, may be confinmed by any one of a number of conventional assays. 

For example, Shone C. (1987) describes an in vitro assay employing liposomes, 
which are challenged with a test molecule. Presence of the requisite translocation 
function is confirmed by release from the liposomes of K* and/or labelled NAD, 
25 which may be readily monitored (see Shone C. (1 987) Eur. J. Biochem; vol. 1 67(1 ): 
pp. 175-180). 

A further example is provided by Blaustein R. (1987). which describes a simple in 
vitro assay employing planar phospholipid bilayer membranes. The membranes are 
30 challenged with a test molecule and the requisite translocation function is confinned 
by an increase in conductance across said membranes (see Blaustein (1 987) FEBS 
^ ■ - - . - ' Letts; vol. 226, no. 1: pp. 11.5r120)» -o - - ..-^ - ^ . 
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Additional methodology to enable assessment of membrane fusion and thus 
identification of Translocation Domains suitable for use in the present invention are 
■proy\6e^~by Methods -in-Enzymologyr^fo\s.^ '220 and -221. Membrane Fusion 
5 Techniques, Parts A and B, Academic Press 1 993. 

The Translocation Domain Is preferably capable of formation of lon-pemneable pores 
in lipid membranes under conditions of low pH. Preferably, it has been found to use 
only those portions of the protein molecule capable of pore-fomiation within the 
10 endosomal membrane. 

The Translocation Domain may be obtained from a microbial protein source, in 
particular from a bacterial or viral protein source. Hence, in one embodiment, the 
Translocation Domain is a translocating domain of an enzyme, such as a bacterial 
15 toxin or A/iral protein. • 

It is well documented that certain domains of bacterial toxin molecules are capable 
of fomriing such pores. It Is also known that certain translocation domains of virally 
expressed membrane fusion proteins are capable of forming such pores. Such 
20 domains may be employed In the present invention. 

The Translocation Domain may be of a clostridial origin, namely the Hn domain (or a 
functional component thereof). Hn means a portion or fragment of the H-chain of a 
clostridial neurotoxin approximately equivalent to the amino-tenninal half of the H- 
25 chain, or the domain con-esponding to that fragment in the intact H-chain. Examples 
of suitable clostridial Translocation Domains include: 

Botulinum type A neurotoxin - amino acid residues (449-871) 

Botulinum type B neurotoxin - amino acid residues (441 -858) 

30 Botulinum type C neurotoxin - amino acid residues (442-866) 

Botulinum type D neurotoxin - amino acid residues (446-862) 

. . .'.BotuHnuhf>.type E neurotoxin • '-^ - amino acid residues (423-845) 
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Botulinum type F neurotoxin 
Botulinum type G neurotoxin 
Tetanus neurotoxin 



amino acid residues (440-864) 
amino acid residues (442-863) 
amino acid residues (458-879) 



5 For further details on the genetic basis of toxin production in Clostridium botulinum 
and C. tetani, we refer to Henderson ef a/. (1997) in The Clostridia: Molecular 
Biology arid Pathogenesis. Academic press. 

The term Hn embraces naturally-occumng neurotoxin Hn portions, and modified Hn 
10 portions having amino acid sequences that do not occur in nature and/or synthetic 
amino acid residues, so long as the modified Hn portions still demonstrate the 
above-mentioned translocation function. 

Alternatively, the Translocation Domain may be of a non-clostridial origin (see 
15 table below). Exam pies of non-clostridial Translocation Domain origins include, but 
are not restricted to, the translocation domain of diphtheria toxin [O'Keefe ef a/., 
Proc. Natl. Acad. Sci. USA (1992) 89, 6202-6206; Silvennan ef a/., J. Biol. Chem. 
(1993) 269, 22524-22532; and London, E. (1992) Biochem. Biophys. Acta., 1112, 
pp.25-51], the translocation domain of Pseudomonas exotoxin type A [Prior et al. 
20 Biochemistry (1992) 31, 3555-3559], the translocation domains of anthrax toxin 
[Blanke ef al. Proc. Natl. Acad. Sci. USA (1996) 93, 8437-8442], a variety of 
fusogenic or hydrophobic peptides of translocating function [Plank et al. J. Biol. 
Chem. (1994) 269, 12918-12924; and Wagner et al (1992) PNAS, 89, pp.7934- 
7938], and amphiphilic peptides [Murata ef a/(1992) Biochem., 31, pp.1 986-1 992]. 
25 The Translocation Domain may mirror the Translocation Domain present in a 
naturally-occumng protein, or may include amino acid variations so long as the 
variations do not destroy the translocating ability of the Translocation Domain. 

Particular examples of viral Translocation Domains suitable for use In the present 
30 invention include certain translocating domains of virally expressed membrane 

fusion proteins. For example. Wagner ef al. (1992) and Murata ef al. (1992) 
1' r ' describe^thB tFShsTocation (i:e; membrane.fusion and vesiculation) function of a. 
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number of fusogenic and amphiphilic peptides derived from tlie N-terminal region of 
influenza virus haemagglutinin. Other virally expressed membrane fusion proteins 
l<nown to liave the desired translocating activity are a translocating domain of a 
fusogeniapeptide of Semlil«i-Fopest Virus(SFV); a translocating domain of vesicular 
stomatitis virus (VSV) glycoprotein G, a translocating domain of SER virus F protein 
and a translocating domain of Foamy virus envelope glycoprotein. Virally encoded 
"spike proteins" have particular application in the context of the present invention, 
for example, the E1 protein of SFV and the G protein of VSV. 

Use of the Translocation Domains (listed below) includes use of sequence variants 
thereof. A variant may comprise one or more conservative nucleic acid substitutions 
and/or nucleic acid deletions or insertions, with the proviso that the variant 
possesses the requisite translocating function. A variant may also comprise one or 
more amino acid substitutions and/or amino acid deletions or Insertions, so long as 
the variant possesses the requisite translocating function. 



Translocation 
Domain source 


Amino acid 
residues 


References 


Diphtheria toxin 


194-380 


Silvemnan ef a/., 1994. J. Biol. 
Chem. 269, 22524-22532 
London E., 1992, Biochem. 
Biophys. Acta., 1113, 25-51 


Domain II of 

pseudomonas 

exotoxin 


405^13 


Prior eta/., 1992, Biochemistry 
31,3555-3559 

Kihara & Pastan, 1994, Bioconj 
Chem. 5, 532-538 


Influenza virus 
haemagglutinin 


GLFGAIAGFIENGWE 
GMIDGWYG, and 
Variants thereof 


Plank ef a/., 1994, J. Biol. Chem. 

269,12918-12924 

Wagner ef a/., 1992, PNAS, 89, 

7934-7938 

Murata ef a/., 1992, Biochemistry 
31,1986-1992 
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"TrsinclACSition 
1 1 alloiuuaUVll 

Domain source 


Amino acid 
residues 


References 


Semliki Forest vims 
fusogenic protein 


Translocation aomain 


IVieilarl vial,, I99U| u v-»cii wiwi. 

134(4), 863-872 


Vesicular Stomatitis 
virus glycoprotein G 


118-139 


Yao et a/., 2003. Virology 310(2), 
319-332 


SER virus F protein 


1 ransiOCaiion uoinairi 


^i»th a/ 2003 J Virol 77^1 1 ^ 
6520-6527 


Foamy virus 

envelope 

glycoprotein 


Translocation domain 


Plcard-Maureau etal., 2003, J 
Virol. 77(8). 4722-4730 



Once a potential receptor agonist (e.g. an 0RL1 agonist) has been identified, one or 
more of the following optional steps may be carried out: 



(A) (xjnfinning that the putative agonist molecule or agonist is 
capable of being combined with a non-cytotoxic protease (or 
a fragment thereof) and optionally a Translocation Domain to 
form a conjugate of the present invention; and/or 

(B) confirming that said putative agonist molecule or agonist 
binds to the receptor on the pain-sensing target cell, which 
receptor is susceptible to receptor-mediated endocytosis; 
and/or 

(C) confimriing that said putative agonist molecule or agonist is 
able to deliver a non-cytotoxic protease (or fragment thereof) 
into the cytosol of a pain-sensing target cell. 

The above steps (A)-(C) may be confirmed by routirte tests that would be readily 
available to a skilled person . 
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For example, step (A) may be perfomied by a simple chemical conjugation 
experiment using conventional conjugation reagents and/or linl<er molecules, 
followed by native polyacrylamide gel electroplioresis to confimn tliata conjugate of 
-tlie-present-invention'is-fomFied-ttiat has the anticipated molecular weight. The 
5 conjugate components are typically linked together (optionally via linlcer molecules) 
by covalent bonds. 

For example, step (B) may be performed by any one of a range of methodologies for 
assessment of binding of a ligand. Standard text, for example "Receptor-Ligand 

10 Interactions. A Practical Approach. Ed. E. C. Hulme. IRL Press, 1 992" are available 
that describe such approaches in detail. In brief, the agonist or putative agonist 
molecule Is labelled (for example, with 125-iodine) and applied to a cell preparation 
in vitro in the presence of an excess of unlabelled agonist. The purpose of the 
unlabelled material is to saturate any non-specific binding sites. The agonist Is 

15 - incubated with the cell preparation for sufficient time to achieve equilibrium, and the 
amount of label bound to the cells assessed by measuring cell associated 
radioactivity, for example by scintillation or gamma counting. 

A further example involves gold-labelling of the agonist (or putative agonist), 
20 followed by the use of electron microscopy to monitor the cellular transport progress 
of the labelled agonist [see the basic methodology described by Rabinowitz S. 
(1992); J. Cell. Biol. 116(1): pp. 95-112; and that described by van Deurs (1986); J. 
Cell. Biol. 102: pp. 37-47]. 

25 For example, step (C) may be perfomied by contacting the conjugate prepared in 
step (A) with a suitable target cell and assessing cleavage of the substrate. This is 
performed by extraction of the SNARE proteins, followed by Western blotting of 
SDS-PAGE-separated samples. Cleavage of substrate Is indicative of delivery of 
the protease into the target cell. In this regard, cleavage may be monitored by 

30 disappearance of substrate and/or appearance of cleavage product. A particularly 
useful antibody that selectively binds to the cleaved substrate product is described 
in WO95/33850. • 
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Preparation of a conjugate according to the present invention is now discussed. 

■ It is known in the art that the He portion of a neurotoxin molecule can be removed 
5 from the other portion of the H-chain, known as Hn, such that the Hn fragment 
remains disulphide linked to the L-chain of the neurotoxin providing a fragment 
known as LHn. Thus, in one embodiment of the present invention the LHn fragment 
of a neurotoxin is covalently linked, using linkages which may include one or more 
spacer regions, to a TM. 

10 

in another embodiment of the invention, the He domain of a neurotoxin is mutated, 
blocked or modified, e.g. by chemical modification, to reduce or preferably 
incapacitate its ability to bind the neurotoxin to receptors at the neuromuscular 
junction. This modified neurotoxin is then covalently linked, using linkages which 
15 may include one or more spacer regions, to a TM. 

In another embodiment of the invention, the H-chain of a neurotoxin, in which the He 
.domain is mutated, blocked or modified, e.g. by chemical modification, to reduce or 
preferably incapacitate its native binding ability, is combined with the L-chain of a 
20 different neurotoxin, or another protease capable of cleaving a protein of the 
exocytic fusion apparatus (e.g. IgA protease of N. gonorrhoeae). This hybrid, 
modified neurotoxin is then covalently linked, using linkages which may include one 
or more spacer regions, to a TM. 

25 In another embodiment of the invention, the Hn domain of a neurotoxin is combined 
with the L-chain of a different neurotoxin, or another protease capable of cleaving a 
protein of the exocytic fusion apparatus (e.g. IgA protease of N. gonorrhoeae). This 
hybrid is then covalently linked, using linkages which may include one or more 
spacer regions, to a TM. 

30 

In another embodiment of the invention, the protease (for example the L-chain 
- :Component of a neurotoxin) is covalently linked, using linkages that may include one 
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or more spacer regions, to a TM that can also effect the intemallsation of the 
protease into the cytoplasm of the relevant target cell(s). 

In -another- embodiment of -^he invention, the-protease (for example the L-chain 
5 component of a neurotoxin) Is covalently linked, using linkages which may include 
one or more spacer regions, to a translocation domain to effect transport of the 
protease fragment into the c^osol. 

In use, the domains of a conjugate according to the present invention are associated 
10 with each other. In one embodiment, two or more of the domains may be joined 
together either directly (e.g. by a covalent linkage), or via a linker molecule. 

A variety of different linker/ spacer molecules may be employed in any of the fusion 
proteins of the present invention . Examples of such spacer molecules include those 
15 - illustrated in Figures 31 and 32. Particular mention here is made to GS15, GS20, 
GS25. and Hx27 - see Figures 31 and 32. 

The present inventors have unexpectedly found that non-cytotoxic protease-TM 
conjugates (eg. CPNv/A) may demonstrate an improved binding activity for 

20 nociceptive sensory afferents when the size of the spacer Is selected so that (in use) 
the TM (preferably the C-terminus thereof) and the translocation domain (preferably 
the N-terminus thereof) are separated from one another by 40-105 angstroms, 
preferably by 50-100 angstroms, and more preferably by 50-90 angstroms. In 
another embodiment, the preferred spacers have an amino acid sequence of 1 1-29 

25 amino acid residues, preferably 1 5-27 amino acid residues, and more preferably 20- 
27 amino acid residues. Suitable spacers may be routinely identified and obtained 
according to Crasto, C.J. and Feng, J.A. (2000) IVlay, 13(5), pp. 309-312 - see also 
http;//www.fccc./edu/researoh/labsffenq/limker.htrnl. 



30 Conjugation techniques suitable for use in the present invention have been well 
documented and are routine for a person skilled in the art. 
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The methodology involved In coupling two protein molecules (A and B) together Is 
simple, and is achieved through the use of a cross-linking agent (also known as a 
chemical coupling agent). For example, molecules A and B are separately 
contacted wlth^ a-eposs-linking-agentvwhieh chemlcally-modifies a specific surface 
5 group on each of molecules A and B thereby fomiing derivatlsed molecules A' and 
B'. The modified surface group on molecule A' Is capable of covalently bonding with 
the modified surface group on molecule B'. Thus, the coupling reaction Is completed 
by mixing together the two protein molecules A' and B'. 

10 Chemical conjugation is illustrated by reference to the following embodiments, where 
P = non-cytotoxic protease component, T = translocation component, and TM = 
targeting moiety. 

In one embodiment, a single chain P - T Is prepared, which is then conjugated to a 
15 -TM . In another embodiment, a single chain TM - T (or T - TM) is prepared, which Is 
then conjugated to a P. In a further embodiment, a single chain P - TM (or TM - P) 
is prepared, which is then conjugated to a T. Another particularly prefen-ed 
conjugate has the structure P - TM - T (with an optional protease cleavage site 
between P and TM). 

20 

Where the T and P components are prepared as a single chain polypeptide, a 
protease cleavage site Is typically included between said components. Any protease 
cleavage site may be employed in this regard. 

25 In an alternative embodiment, the three components may be simultaneously or 
sequentially conjugated together. Thus, the conjugation may be a one- or two-step 
process, and may include one or more different coupling agents. 

Chemical coupling agents and aoss-linking agents have been commercially 
30 available for many years. 

• " Example 5 of the*present invention describes in detail the use of one such coupling 
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agent, namely SPDP, to chemically couple two protein molecules (nociceptin, and 
the LHn of botulinum neurotoxin). The two molecules are separately contacted with 
SPDP, and then mixed together to allow covalent conjugation. 

The conjugate described in Example 6 confirms that another coupling agent, 
PDPH/EDAC, or Traut's reagent, may be employed as an altemative coupling agent 
to SPDP. 

SPDP and Traut's reagent are popular and well-documented coupling agents in the 
technical field of protein conjugation chemistry and are presented here simply as two 
examples of a well known class of compounds that may be employed to covalently 
link together the Targeting Moiety component and the clostridial neurotoxin 
component of the conjugate of the present invention. Other suitable agents include 
SMPB, SMCC (succinimidyl4-(N-maleimidomethyl) cyclohexan-1-carboxylate), and 
•tC-SPDP. - • 

In more detail, commercially available members of the well-known coupling agents 
may be used for conjugation purposes to produce a conjugate of the invention. 
Details of such agents can be found in the following publications: 

. Hermanson, G.T. (1996), Bloconjugate techniques, Academic Press; 

Wong, S.S. (1991), Chemistry of protein conjugation and cross-linking, 
CRC Press; 

Thorpe et al (1987), Cancer Res, 1987, 47. 5924-31. This paper 
describes the use of SMBT (sodium S-4-succinimidyloxycarbonyl- 
alpha-methyl benzyl thiosulfate) and SMPT (4- 
succinimidyloxycarbonyl-alpha-methyl-alpha(2-pyridyldithio)toluene); 

and 

)f.-v/ peetem et al- (1S89)i J Immunol Methods. 1989. 120, 133^3. This . ' 
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paper describes the use of 4 coupling reagents, MHS (succinimidyl 6- 
(N-maleimido)-n-hexanoate), SMCC (succinimidyl 4-(N- 
maleimidometiiyl)-cyGlohexane-1-carboxylate), MBS (succinimidyl m- 
-maleimidobenzoat©), and SPDPr- 

The conjugates according to the present invention may also be prepared 
recombinantly, as detailed in Examples 9 to 12. 

In one embodiment, the preparation of a recombinant conjugate involves 
an-angement of the coding sequences of a selected TM, a selected non-cytotoxic 
protease component, and a translocation component (in any order) in a single 
genetic construct. These coding sequences may be an^nged in-frame so that 
subsequent transcription and translation Is continuous through both coding 
sequences and results in a fusion protein. All constmcts would have a 5' ATG codon 
to encode an N-terminal methionine, and a C-temninal translational stop codon. 

Thus, the recombinant preparation method results in the generation of a single chain 
polypeptide. In order to activate this polypeptide, a protease cleavage site is present 
between the non-cytotoxic protease component and the translocation component. 
Cleavage of this site generates a di-chain polypeptide in which the protease and 
translocation domains are linked together by way of a covalent bond, preferably a 
disulphide bond. In this regard, any protease cleavage site may be employed. 

In the single polypeptide aspect of the present invention, the TM is preferably either 
N- or C-terminally located with respect to the fusion protein. In other words, it is 
preferred that the TM is not located between the P and T components of the single 
polypeptide fusion protein. In a particularly preferred embodiment, the TM is N- 
tenmlnally located with respect to the fusion protein. 

In one embodiment, an L-chain of a clostridial neurotoxin or another protease 
capable of cleaving a protein of the exocytic fusion apparatus (e.g. an IgA protease), 
or a^fragme^t/variant thereof, may be^^expressedrecombinantly as a fusion protein 
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with a TM, which TM can also effect the internalisation of the L-chain component 
into the cytoplasm of the relevant target cell(s) responsible for secretion. 
Alternatively, the fusion protein may further comprise a Translocation Domain. The 
expressed fusion protein may include one or more spacer regions. 

By way of example, the following infomnation is required to produce, recombinantly, 
an agent of the present invention: 

(I) DNA sequence data relating to a selected TM; 

(II) DNA sequence data relating to the protease component; 

(III) DNA sequence data relating to the translocation domain; and 

(IV) a protocol to permit construction and expression of the 
construct comprising (I), (II) and (III). 

All of the above basic infomiation (l)-(IV) are either readily available, or are readily 
detemiinable by conventional methods. For example, both WO98/07864 and 
WO99/17806 exemplify recombinant technology suitable for use In the present 
application. 

In addition, methods for the construction and expression of the constructs of the 
present invention may employ Information from the following references and others: 

Lorberboum-Galski. H., FitzGerald, D., Chaudhary, V., Adhya. S., 
Pastan, I. (1988), Cytotoxic activity of an interleukin 2-Pseudomonas 
exotoxin chimeric protein produced in Escherichia coli. Proc.Natl. 
Acad. Scl. USA, 85(6):1922-6; 

Murphy, J.R. (1988), Diphtheria-related peptide homione gene 
fusions: a molecular genetic approach to chimeric toxin development. 
Cancer Treat. Res.; 37:123-40; 

... ..yv.i}liams, .D.P„..Rad<er, X.,- .B^icharrP.-; Jiishai. ^W^..- Borowski, M.. . 
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Genbauffe, F., Strom, T.B., Murphy. J.R. (1987), Diphtheria toxin 
receptor binding domain substitution with interleul<in-2: genetic 
construction and properties of a diphtheria toxin-related interleukin-2 
-fusion protein; Protein Eng;1(6):493-8; 

Arora, N., WiHiamson, L.C., Leppla, S.H., Halpem, J.L. (1994), 
Cytotoxic effects of a chimeric protein consisting of tetanus toxin light 
chain and anthrax toxin lethal factor in non-neuronal cells J. Biol. 
Chem., 269(42):261 65-71; 

Brinkmann, U., Reiter, Y.. Jung, S.H.. Lee, B.. Pastan, I. (1993), A 
recombinant immunotoxin containing a disulphide-stabilized Fv 
fragment Proc. Natl. Acad. Sci. USA, 90(16):7538-42; and 

O'Hare, M.r Brown, A.N., Hussain, K., Gebhardt, A., Watson, G., 

Roberts, L.M., Vitetta, E.S., Thorpe, P.E., Lord. J.M. (1990), 
Cytotoxicity of a recombinant ricin-A-chain fusion protein containing a 
proteolytlcally-cleavable spacer sequence. FEBS Lett Oct 29;273(1 - 
2):200^. 

Suitable clostridial neurotoxin sequence infonnation relating to L- and LHrrchains 
may be obtained from, for example, Kurazono, H. (1992) J. Biol. Chem., vol. 267, 
No. 21, pp.14721 -14729; and Popoff, M.R., and Marvaud, J.-C. (1999) The 
Comprehensive Sourcebool< of Bacterial Protein Toxins, 2nd edition (ed. Alouf, J.E., 
and Freer, J.H.), Academic Press, pp.1 74-201. 

All of the aforementioned publications are hereby incorporated into the present 
specification by reference thereto. 

Similarly, suitable TM sequence data are widely available in the art. Alternatively, 
any necessary sequence data may be obtained by techniques which are well-known 
to the skillea person?'- ' v T^^^^^^ -r--^'^- ' , 
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For example, DNA encoding the TM component may be cloned from a source 
organism by screening a cDNA library for the correct coding region (for example by 
-using specific oligonucleotldes-based orr the known sequence information to probe 
5 the library), Isolating the TM DNA, sequencing this DNA for confimiatlon purposes, 
and then placing the isolated DNA In an appropriate expression vector for 
expression in the chosen host. 

As an alternative to isolation of the sequence from a library, the available sequence 
10 information may be employed to prepare specific primers for use in PGR, whereby 
the coding sequence is then amplified directly from the source material and, by 
suitable use of primers, may be cloned directly into an expression vector. 

Another altemative method for Isolation of the coding sequence Is to use the existing 
15 sequence information and syntheslse a copy, possibly incorporating alterations, 
using DNA synthesis technology. For example, DNA sequence data may be 
generated from existing protein and/or RNA sequence infonnatlon. Using DNA 
synthesis technology to do this (and the altemative described above) enables the 
codon bias of the coding sequence to be modified to be optimal for the chosen 
20 expression host. This may give rise to superior expression levels of the fusion 
protein. 

Optimisation of the codon bias for the expression host may be applied to the DNA 
sequences encoding the TM and clostridial components of the construct. 
25 Optimisation of the codon bias Is possible by application of the protein sequence into 
freely available DNA/proteIn database software, e.g. programs available from 
Genetics Computer Group, Inc. 

Having prepared a conjugate of the invention, it Is a matter of routine to conflmi that 
30 the various domains have retained their specified function. 



V w-o^^. rprotease'functioh after conjugation may be ^t^^^ . > 
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the following routine tests: 

SNAP-25 (or synaptobrevin, or syntaxin) may be challenged with a conjugate to be 
testedr -and then- analysed by SDS-PAGE peptide separation techniques. 
5 Subsequent detection of peptides (e.g. by silver staining) having molecular weights 
conresponding to the cleaved products of SNAP-25 (or other component of the 
neurosecretory machinery) would confinn the presence of a functional L-chain. 

As a further altemative, the conjugate may be tested by assaying for SNAP-25 (or 
10 synaptobrevin, or syntaxin) cleavage products via antibody-specific binding (see 
WO95/33850). In more detail, a specific antibody Is employed for detecting 
cleavage of SNAP-25. Since the antibody recognises cleaved SNAP-25, but not 
uncleaved SNAP-25, identification of the cleaved product by the antibody conflmis 
the presence of L-chain proteolytic function. By way of exemplification, such a 
15 method is described in Examples 2 and 3 of W096/33273. 

Translocation component function after conjugation may be tested using, for 
example, any one of the following routine tests: 

20 Suitable methods are, for example, described by Shone et al. (1987) Eur. J. 
Biochem. 1 67, pp.1 75-1 80; and by Blaustein et al. (1 987) FEBS 226 (1 ), pp.1 1 5- 
120. 

The Shone et al. method employs artificial liposomes loaded with potassium 
25 phosphate buffer (pH 7.2) and radlolabelled NAD. Release of K+ and NAD from the 
liposomes correlates with a positive result for channel fomiing activity and hence 
translocation activity. In this regard, K+ release from liposomes may be measured 
using an electrode and NAD release calculated by measuring the radioactivity in the 
supernatant (see page 176, column 1, line 33 - column 2, line 17). 

30 

The Blaustein etal. method employs planar phospholipid bilayer membranes, which 
f.; ?«;;areaseSl'to testfor channelfonTi activity, lamore detail, saltsojutions on either 1 .^^^ 
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Side of the membrane are buffered at a different pH - on the cis side, pH 4.7 or 5.5 
and on the trans side, pH 7.4. The "conjugate" to be tested Is added to the cis side 
of the membrane and electrical measurements are made under voltage clamp 
--eonditionsT-in order to-monitop-the flow of-eun-ent across the membrane (see 
5 paragraph 2.2, pages 116-11 8). The presence of an active translocation function is 
confirmed by a steady rate of channel turn-on (i.e. a positive result for channel 
fonnation) -see paragraph 3, page 118. 

Targeting Moiety (TM) function after conjugation may be tested by assaying for the 
10 agonist function inherent to the TM. Suitable methods include those described in 
Example 1. 

The ability of the conjugate of the invention to inhibit substance P release from 
nociceptive afferent cells can be assessed using the methods described in 
15 -Example 15. 

In Example 15, a nociceptin-LHN/A conjugate according to the first aspect of the 
Invention Is assessed for its ability to inhibit the release of substance P from primary 
nociceptive sensory afferent neurons. As can be seen from Table 1 , incubation of 
20 the conjugate with cultures of nociceptive afferent neurons results in a significant 
Inhibition of release of substance P (when compared to incubation of the cells with 
LHN/A alone). The experiment therefore confirms that the conjugate is inhibiting 
substance P release from these cells. 

25 In use of the present invention, a pain-sensing target cell is selected in which it is 
desired to reduce or inhibit the process of exocytic fusion, which exocytic process 
contributes to the symptoms associated with the sensation of pain. For example, the 
target cell in question may demonstrate an undesirable phenotype (e.g. an 
undesirable secretion, or the expression of an undesirable concentration of 

30 membrane receptor, transporter or membrane channel), which contributes to the 
symptoms associated with pain. Alternatively, a target cell may be selected in which 
"ft vrvr«'the'iptocessofiex6cyticfusion.contri^ -.yr 
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In preferred embodiments of the invention, the target cell is a nociceptive sensory 
afferent cell, preferably a primary nociceptive afferent cell (e.g. an A-fibre such as 
anA6-fibre or a G-fibre>;-Thus, the conjugates of the present Invention are capable 

5 of Inhibiting neurotransmitter or neuromodulator (e.g. glutamate, substance P, 
calcitonin-gene related peptide (CGRP), and/or neuropeptide Y) release from 
discrete populations of nociceptive sensory afferent neurons. In use, the conjugates 
reduce or prevent the transmission of sensory afferent signals (e.g. 
neurotransmitters or neuromodulators) from peripheral to central pain fibres, and 

10 therefore have application as therapeutic molecules for the treatment of pain, in 
particular chronic pain. 

It is routine to confirm that a TM binds to a nociceptive sensory afferent. For 
example, a simple radioactive displacement experiment may be employed in which 

15 tissue or cells representative of the nociceptive sensory afferent (for example 
DRGs) are exposed to labelled (e.g. tritiated) ligand in the presence of an excess of 
unlabelled ligand. In such an experiment, the relative proportions of non-specific 
and specific binding may be assessed, thereby allowing confimriation that the ligand 
binds to the nociceptive sensory afferent target cell. Optionally, the assay may 

20 Include one or more binding antagonists, and the assay may further comprise 
observing a loss of ligand binding. Examples of this type of experiment can be 
found in Hulme, E.G. (1990), Receptor-binding studies, a brief outline, pp 303-31 1 , 
In Receptor biochemistry, A Practical Approach, Ed. E.G. Hulme, Oxford University 
Press. 

25 . 

According to a second aspect, the present invention provides a non-cytotoxic 
conjugate for inhibition or reduction of exocytotic fusion in a nociceptive sensory 
afferent cell, comprising: 

30 (i) a Targeting Moiety (TM), 

r , v. - . . Wherein said.TM.is #n .agonist-qf ateceptor th^t is pi;esent on - , j . 



wo 2006/059105 



PCT/GB2005/004598 



29 

said nociceptive sensory afferent cell, and wherein said 
receptor undergoes endocytosis to be incorporated Into an 
endosome within the nociceptive sensory afferent cell; 

5 (ii) a DNA sequence encoding a non-cytotoxic protease or a 

fragment thereof, 

wherein the DNA sequence is expressible in the nociceptive 
sensory afferent ceil and when so expressed provides a 
protease or protease fragment capable of cleaving a protein of 
the exocytic fusion apparatus of said nociceptive sensory 
afferent cell; and 

a Translocation Domain, 

wherein the Translocation Domain translocates the DNA 
sequence encoding the protease or protease fragment from 
within the endosome, across the endosomal membrane, and 
into the nociceptive sensory afferent cell. 

In a prefenred embodiment, the receptor is an ORLi receptor. 

DNA encoding a protein of interest can be transfected into eukaryotic cells through 
receptor-mediated endocytosis of a protein-DNA conjugate, as confirmed by Cotton 

25 ef a/. (Cotton, M., Wagner, E. and Birnstiel, L. (1 993) Receptor-mediated transport of 
DNA into eukaryotic cells. Methods in Enzymol. 217, 619-645). Several methods 
exist for condensing DNA to a suitable size using polycationic ligands. These 
. include: polylysine, various cationic peptides and cationic liposomes. Of these, 
polylysine was used in the present study because of its successfully reported use In 

30 receptor-mediated transfection studies (Cotton ef a/., 1 993). 

c , ~ . • 'The DNA-sequenc6'~en(^ding 'the'n0n-cytotoxier protease; may be 



(ili) 

15 
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expressed under the control of an operably linked promoter present as part of the 
agent (e.g. as part of the protease DNA sequence upstream of the coding region). 
Altematively, expression of the protease component in the target cell may rely on a 
promoteppresent-in the target cell: 

5 

The DNA sequence encoding the protease component may integrate into a DNA 
sequence of the target cell. One or more integration site(s) may be provided as part 
of the conjugate (e.g. as part of the protease DNA sequence). 

10 The TM, Translocation Domain and protease components of this second aspect of 
the invention are as defined for the first aspect of the invention. Examples 1 3 and 
14 describe the preparation of conjugates according to the second aspect of the 
invention. 

15 According to a third aspect, the present invention provides a phamiaceutical 
composition comprising a conjugate according to the first and/or second aspect of 
the present invention. 

The phamnaceutical composition may further comprise a phamiaceutically- 
20 acceptable canrier, and/or a suitable diluent and/or excipient, although the exact fomi 
of the composition may be tailored to the mode of administration. Administration is 
preferably to a mammal, more preferably to a human. 

The components of the composition may, for example, be employed In the form of 
25 an aerosol or nebulisable solution for inhalation or a sterile solution for parenteral 
administration, Intra-artlcular administration or intra-cranlal administration. 

The composition may also be administered by i.v. injection, which includes the use 
of pump systems. Spinal injection (e.g. epidural or Intrathecal) or indwelling pumps 
30 may also be used. 

- i r. vTfiedbsa'ge rah^eis fopadministration of tliexwmponents ofthe present invention are .r ^ . . 
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those to produce the desired therapeutic effect. It will be appreciated that the 
dosage range required depends on the precise nature of the components, the route 
of administration, the nature of the formulation, the age of the patient, the nature, 
extent or severity of the patient's condition, contraindications, if any, and the 
5 judgement of the attending physician. 

Suitable dally dosages (for each component) are in the range 0.0001-1 mg/kg, 
preferably 0.0001-0.5 mg/kg, more preferably 0.002-0.5 mg/kg, and particularly 
preferably 0.004-0.5 mg/kg. The unit dosage can vary from less that 1 microgram to 
10 30 mg, but typically will be in the region of 0.01 to 1 mg per dose, which may be 
administered dally or preferably less frequently, such as weekly or six monthly. 

A particularly preferred dosing regimen is based on 2.5 ng of fusion protein (e.g. 
CPNv/A) as the IX dose. In this regard, prefenred dosages are In the range IX- 

15 1 0OX (I.e. 2.5-250 ng). This dosage range is significantly lower (i.e. at least 10-fold, 
typically 100-fold lower) than would be employed with other types of analgesic 
molecules such as NSAIDS, morphine, and gabapentin. Moreover, the above- 
mentioned difference is considerably magnified when the same comparison is made 
on a molar basis - this is because the fusion proteins of the present Invention have a 

20 considerably greater Mw than do conventional 'small' molecule therapeutics. 

Wide variations In the required dosage, however, are to be expected depending on 
the precise nature of the components, and the differing efficiencies of various routes 
of administration. For example, oral administration would be expected to require 
25 higher dosages than administration by intravenous injection. 

Variations In these dosage levels can be adjusted using standard empirical routines 
for optimisation, as is well understood In the art. 

30 Compositions suitable for injection may be in the form of solutions, suspensions or 
emulsions, or dry powders which are dissolved or suspended in a suitable vehicle 
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Fluid unit dosage forms are typically prepared utilising a pyrogen-free sterile vehicle. 

-The-aetive ingredients, depending on the vehicle and concentration used, can be 
5 either dissolved or suspended in the vehicle. 

Solutions may be used for allfomns of parenteral administration, and are particularly 
used for intravenous injection. In preparing solutions the components can be 
dissolved in the vehicle, the solution being made isotonic if necessary by addition of 
10 sodium chloride and sterilised by filtration through a sterile filter using aseptic 
techniques before filling Into suitable sterile vials or ampoules and sealing. 
Altematively, if solution stability is adequate, the solution in Its sealed containers 
may be sterilised by autoclaving . 

15 Advantageously additives such-as buffering, solubillsing. stabilising, preservative or 
bactericidal, suspending or emulsifying agents and/or local anaesthetic agente may 
be dissolved in the vehicle. 

Dry powders which are dissolved or suspended in a suitable vehicle prior to use may 
20 be prepared by filling pre-sterilised drug substance and other ingredients into a 
sterile container using aseptic technique in a sterile area. 

Altematively the components of the composition may be dissolved In an aqueous 
vehicle, the solution is sterilized by filtration and distributed into suitable containers 
25 using aseptic technique in a sterile area. The product is then freeze-dried and the 
containers are sealed aseptically. 

Parenteral suspensions, suitable for intramuscular, subcutaneous or Intrademnal 
injection, are prepared in substantially the same manner, except that the sterile 
30 components are suspended in the sterile vehicle, instead of being dissolved and 
sterilisation cannot be accomplished by filtration. The components may be isolated 
./ v\-^^i.* "'-^» Tr=-Vrn:a. sterile ste^^ altematively it may be sterilised after isolation, e g. byjarpma.,, t?,,. 
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irradiation. 

Advantageously, a suspending agent for example polyvinylpyrrolidone is included 
-in-the composition(s) to facilitate unifomri distribution of the components. 

Compositions suitable for administration via the respiratory tract include aerosols, 
nebulisable solutions or microfine powders for insufflation. In the latter case, particle 
size of less than 50 microns, eispeclally less than 10 microns, is prefenred. Such 
compositions may be made up in a conventional manner and employed in 
conjunction with conventional administration devices. 

The compositions described in this invention can be used in vivo, either directly or as 
a pharmaceutically acceptable salt, for the treatment of conditions involving 
exocytosis (for example secretion, or the delivery of proteins such as receptors, 
transporters, and membrane channels to the plasma membrane of a cell). 

According to a fourth aspect, the present invention provides a DNA construct that 
encodes a conjugate according to the first or second aspects of the invention. 

By expressing the constmct in a host cell, conjugates of the invention may be 
prepared. 

According to a fifth aspect, the present invention provides a method of treatment of 
pain by administration to a patient of a conjugate, composition, or construct 
according to the first to fourth aspects of the invention, or any combination thereof. 

In a preferred embodiment, the invention provides a method.of treating chronic pain. 

According to a sixth aspect, the present invention provides for the use of a 
conjugate, composition or construct according to the first to fourth aspects of the 
invention, for the manufacture of a medicament for treating pain, preferably chronic 
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Definitions Section 

Exee^ic fusion-is-a proeess-by which IrvtFaeellular molecules are transported from 
the cytosol of a pain-sensing target cell to the plasma (i.e. cell) membrane thereof. 
Thereafter, the intracellular molecules may become displayed on the outer surface of 
the plasma membrane, or may be secreted into the extracellular environment. 

In a healthy individual, the rate of exocytic fusion is carefully regulated and allows 
control of the transport of molecules between the cytosol and the plasma membrane 
of a pain-sensing cell. For example, regulation of the exocytic cycle allows control of 
the density of receptors, transporters, or membrane channels present at the cell's 
surface, and/or allows control of the seaetion rate of Intracellular components (e.g. 
neurotransmitters) from the cytosol of the cell. 

However, in an unhealthy Individual, the regulation of exocytic fusion may be 
modified. For example, exocytic fusion may cause affected pain-sensing cells to 
enter a state of hypersecretion. Alternatively, exocytic fusion may result in the 
display of an increased concentration of receptors, transporters, or membrane 
channels present on the surface of the pain-sensing, which may expose the cell to 
undesirable external stimuli. Thus, the process of exocytic fusion may contribute to 
the progression and/or severity of pain, and therefore provides a target for 
therapeutic intervention. 

It should also be appreciated that otherwise normal rates of cellular exocytic fusion 
may contribute to the progression and severity of pain in compromised patients. 
Thus, by targeting exocytic fusion in accordance with the present invention. It is also 
. possible to provide therapy in such patients 

Targeting Moiety (TM) means any chemical stmcture associated with a conjugate 
that functionally interacts with a receptor, e.g. an ORLi receptor, to cause a physical 
> association between the corijugafe and the surface of a .pain^sensing target cell. 
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The term TM embraces any molecule (i.e. a naturally occurring molecule, or a 
chemically/physically modified variant thereof) that is capable of binding to a 
receptor on the target cell, which receptor is capable of internalisation (e.g. 
endosome^miatlon)-also^«ferredHo as-receptor-mediated endocytosis. The TM 
5 may possess an endosomal membrane translocation domain, in which case 
separate TM and Translocation Domain components need not be present in an 
agent of the present invention. 

The term "fragment" means a peptide having at least thirty-five, preferably at least 
10 twenty-five, more preferably at least fifteen, and most preferably at least ten amino 
acid residues of the TM in question. In one embodiment, the first amino acid residue 
of the fragment is the N-temninal amino acid residue of the TM from which the 
fragment has been derived. 

15 • An-example of a "variant"-is a peptide orpeptide fragment of a TM that contains one 
or more analogues of an amino acid (e.g. an unnatural amino acid), or a substituted 
linkage. 

A "derivative" comprises the TM In question, and a further peptide sequence. The 
20 further peptide sequence should preferably not interfere with the basic folding and 
thus conformational structure of the TM. Two or more peptides (or fragments, or 
variants) may be joined together to fomn a derivative. Alternatively, a peptide (or 
fragment, or variant) may be joined to an unrelated molecule (e.g. a second, 
unrelated peptide). Derivatives may be chemically synthesized, but will be typically 
25 prepared by recombinant nucleic acid methods. Additional components such as 
lipid, and/or polysaccharide, and/or polyketide components may be included. 

The term non-cytotoxic means that the protease molecule in question does not kill 
the pain-sensing target cell to which it has been re-targeted. 

30 

The "protease cleavage site" of the present invention allows cleavage (preferably 
' • cantrblled cleavage) of thB xofljtsgate at arposition. between~the non-cytotoxic 
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protease component and the TM component. In one embodiment, the conjugate 
may include more than one proteolytic cleavage site. However, where two or more 
such sites exist, they are different, thereby substantially preventing the occurrence of 
- fnultiple-cleavage~«vepits-in- the-presence -of a single protease. In another 

5 embodiment, it is prefen-ed that the conjugate has a single protease cleavage site. 
The protease cleavage sequence(s) may be Introduced (and/or any Inherent 
cleavage sequence removed) at the DNA level by conventional means, such as by 
site-directed mutagenesis. Screening to confimn the presence of cleavage 
sequences may be performed manually or with the assistance of computer software 

10 (e.g. the MapDraw program by DNASTAR, Inc.). 

Whilst any protease cleavage site may be employed, the following are prefenred: 



Enterokinase (DDDDK4) 

Factor Xa (lEGRi / IDGRi) 

TEV(Tobacco Etch virus) (ENLYFQiG) 

Thrombin (LVPRiGS) 

PreScission (LEVLFQiGP). 



20 Also embraced by the temi protease cleavage site is an intein, which Is a self- 
cleaving sequence. The self-splicing reaction is controllable, for example by varying 
the concentration of reducing agent present. 

The present invention is now described by reference to the following Examples and 
25 Figures, without intended limitation thereto. 



Figures 

Figure 1 Expression and purification of recLHw/B fusion protein 

30 Figure 2 Expression and purification of LHn/C fusion protein 

Figure 3 Expression and purification of N[1 -1 7]-LHn/A fusion protein 
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Figure 5 Purification of a nociceptin-LC/A-HN/A fusion protein 
Figure 6 Purification of a LC/C-nociceptin-HN/C fusion protein 
Figure 7 Purification of a LC/A-met enl<eplialin-HN/A fusion protein 

-Rgur«-8 Gomparisen of-binding effieaey-of a-LG/A-nociceptin-HN/A fusion 

5 protein and a nodceptln-LC/A-HN/A fusion protein 

Figure 9 In vitro catalytic activity of a LC/A-nociceptin-HN/A fusion protein 
Figure 1 0 Purification of a LC/A-nociceptin variant-HN/A fusion protein 
Figure 11 Comparison of binding efficacy of a LC/A-nociceptin-HN/A fusion 
protein and a LC/A-nociceptin variant-Hisj/A fusion protein 
10 Figure 1 2 Expressed / purified LC/A-nociceptln-HN/A fusion protein family witli 
variable spacer length product(s) 
Figure 1 3 Inhibition of SP release and cleavage of SNAP-25 by CPN-A 
Figure 1 4 Inhibition of SP release and cleavage of SNAP-25 over extended 
time periods after exposure of DRG to CPN-A 
15 -Figure 15 Cleavage ofSNAP-25 by CPNv-A • 

Figure 1 6 Cleavage of SNAP-25 over extended time periods after exposure of 
DRG to CPNv-A 

Figure 17 CPNv-A fusion-mediated displacement of [3H]-nociceptin binding 
Figure 1 8 Expressed / purified CPNv(Ek)-A product 
20 Figure 1 9 Cleavage of SNAP-25 by CPN v(Ek)-A 
Figure 20 Expressed / purified CPNv-C product 
Figure 21 Cleavage of syritaxin by CPNv-C 

Figure 22 CPN-A efficacy in the Acute Capsaicin-lnduced Mechanical Allodynia 
model 

25 Figure 23 CPN-A efficacy in the Streptozotocin (STZj-lnduced Peripheral 
Diabetic Neuropathy (Neuropathic Pain) model 
Figure 24 CPNv-A efficacy in the Acute Capsaicin-lnduced Mechanical Allodynia 
model 

Figure 25 Expressed / purified LC/A-CPLE-Hn/A product 
30 Figure 26 Expressed / purified LC/A-CPBE-Hn/A product 
Figure 27 Expressed / purified CPOP-A product 

^r^^- V iC'. ■ '^^Bi§ar©<28: -c-Expressed/purtftesJ CPOPv-Aproduct . . : .. ,. .. . ^ w « " "■'.^ ■;siiifiis: 
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Figure 29 In vitro SNAP-25 cleavage In a DRG cell model 

Figure 30 Expressed / purified CPNv-A-FXa-HT (removable his-tag) 

Figure 31 In vitro efficacy of LC/A-nociceptin-HN/A fusion proteins with variable 

spacer length, as assessed by ligand competition assay 

Figure 32 In vitro efficacy of LC/A-nociceptin-HN/A fusion proteins with variable 
spacer length, as assessed by in vitro SNAP-25 cleavage 



The Figures are now described in more detail. 



10 Figure 1 •> Expression and purification of reclHtJB fusion protein 



SDS-PAGE analysis of expression and purification of recLHulB from E. coll. In 
Flgurel, /©cLHn/B is purified from cell paste using a three column strategy as 
described in Example 3. Protein samples are separated by SDS-PAGE and 

15 visualised by staining with simplyblue safestain coomassie reagent. Crude, soluble 
MBP-LHn/B fusion protein contained within the clarified extract (lane 2) is loaded 
onto Q-Sepharose FF anion-exchange resin. Lane 3 represents recombinant MBP- 
LHn/B fusion eluted from column at 150-200 mM salt. This sample is treated with 
factor Xa protease to remove MBP affinity tag (lane 4), and cleaved mixture diluted 

20 to lower salt concentration prior to loading onto a Q-Sepharose FF anion-exchange 
column. Material eluted between 120-170 mM salt was rich in LHn/B (lane 5). 
Protein in lanes 6 and 8 represents LHn/B harvested after treatment with 
enterokinase and final purification using Benzamidine Sepharose. under non- 
reducing and reducing conditions respectively. Lanes 1 and 7 represent molecular 

25 mass markers [Mark 12 (Invitrogen)]. 

Figure 2 - Expression and purification of LHn/C fusion protein 



SDS-PAGE analysis of expression and purification of LHn/C from £. coll. In Figure 
30 2, recLHu/C is purified from E. coli cell paste using a two-step strategy described in 
Example 4. Protein samples are separated by SDS-PAGE and visualised by 
a'iC'ir .<"^stainingiwith:coomassle^^^ --'^r-TA"^;-. 
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Sepharose FF anion-exchange resin. Fusion protein, I\/IBP-LHn/C is eluted with 0.1 
U NaCI (lane 3). Eluted material incubated at 22°C for 16 h with factor Xa protease 
(New England Biolabs) to cleave fusion tag MBP and nick redHnlC at the linl<er site. 
-The proteinof interest is furtheriaurif led from cleaved fusion products (lane 4) using 
5 Q-Sepharose FF. Lanes 5 and 7 show purified redHnlC under non-reducing 
conditions and reduced with 1 0 mM DTT respectively, to illustrate disulphlde bonding 
at the linker region between LC and Hn domains after nicking with factor Xa. Lanes 
1 and 6 represent molecular mass markers (shown in KDa); Mark 12 (Invitrogen). 

10 Figure 3 - Expression and purification of N[1 -1 7]-LHn/A fusion protein 

SDS-PAGE analysis of expression and purification of N[1-1 71-LHN/Afrom E. coli. In 
Figure 3, N[1-17]-LHn/A Is purified from £ coli BL21 cell paste using the 
methodology outlined in Example 9. Briefly, the soluble products obtained following 

15 cell dismption were applied to a nickel-charged affinity capture column. Bound 
proteins were eluted with 1 00 mM imidazole, treated with Factor Xa to activate the 
fusion protein and remove the maltose-binding protein (MBP) tag, then re-applied to 
a second nickel-charged affinity capture column. Samples from the purification 
prtxjedure were assessed by SDS-PAGE (Panel A) and Westem blotting (Panel B). 

20 Anti-nociceptin antisera (obtained from Abeam) were used as the primary antibody 
for Westem blotting. The final purified material in the absence and presence of 
reducing agent is identified in the lanes mari^ed [-] and [+] respectively. 

Figure 4 - Purification of a LC/A-nociceptin-IHN/A fusion protein 

25 

Using the methodology outlined .in Example 26, a LC/A-nociceptin-HN/A fusion 
protein was purified from E. coli BL21 cells. Briefly, the soluble products obtained 
. following cell disruption were applied to a nickel-charged affinity capture column. 
Bound proteins were eluted with 100 mM Imidazole, treated with Factor Xa to 
30 activate the fusion protein and remove the maltose-binding protein (MBP) tag, then 
re-applied to a second nickel-charged affinity capture column. Samples from the 
/ ' A -^cnc; *^^^^ by SDS-RAGE^Pan©l fii)m^ Westem.blottiog ..r" rr;: 
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(Panel B). Anti-nociceptin antisera (obtained from Abeam) were used as the primary 
antibody for Western blotting. The final purified material in the absence and 
presence of reducing agent is identified in the lanes marked [-] and [+] respectively. 



5 Figure 5 - Purification of a nociceptin-LC/A-HN/A fusion protein 

Using the methodology outlined in Example 26, a nociceptin-LC/A-HN/A fusion 
protein was purified from E coli BL21 cells. Briefly, the soluble products obtained 
following cell disruption were applied to a nickel-charged affinity capture column. 

10 Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa to 
activate the fusion protein and remove the maltose-binding protein (MBP) tag, then 
re-applied to a second nidcel-charged affinity capture column. Samples from the 
purification procedure were assessed by SDS-PAGE (Panel A) and Westem btatting 
(Panel B). Anti-nociceptin antisera (obtained from Abeam) were used as the primary 

15 -antibody for Western- blotting; The final purified material in the absence and 
presence of reducing agent is identified in the lanes mari<ed [-] and [+] respectively. 



Figure 6 - Purification of a LC/C-nociceptin-IHN/C fusion protein 

20 Using the methodology outlined in Example 26, an LC/C-nociceptin-HN/C fusion 
protein was purified from.E coli BL21 cells. Briefly, the soluble products obtained 
following cell disruption were applied to & nickel-charged affinity capture column. 
Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa to 
activate the fusion protein and remove the maltose-binding protein (MBP) tag, then 

25 re-applied to a second nickel-charged affinity capture column. Samples from the 
purification procedure were assessed by SDS-PAGE (Panel A) and Westem blotting 
(Panel B). Anti-nociceptin antisera (obtained from Abeam) were used as the primary 
antibody for Western blotting. The final purified material in the absence and 
presence of reducing agent is identified in the lanes marked [-] and [+] respectively. 

30 
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Figure 7 - Purification of a LC/A-met enkephalin-HN/A fusion protein 

-Using-theinethodology outlined in-Example 26-, an LC/A-met enkephalin-HN/A fusion 
5 protein was purified from E. coli BL21 cells. Briefly, the soluble products obtained 

following cell disruption were applied to a nickel-charged affinity capture column. 

Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa to 

activate the fusion protein and rfemove the maltose-binding protein (MBP) tag, then 

re-applied to a second nickel-charged affinity capture column. Samples from the 
10 purification procedure were assessed by SDS-PAGE. The final purified material in 

the absence and presence of reducing agent is identified in the lanes marked [-] and 

[+1 respectively. 

Figure 8 - Comparison of binding efficacy of a LC/A-nociceptin-HN/A fusion 
15 -protein and a nociceptin-LC/A-HN/A fusion protein 

The ability of nociceptin fusions to bind to the ORLi receptor was assessed using a 
simple competition-based assay. Primary cultures of dorsal root ganglia (DRG) were 
exposed to varying concentrations of test material in the presence of 1 nM [3H]- 
20 nociceptin. The reduction in specific binding of the radiolabelled ligand was 
assessed by scintillation counting, and plotted in comparison to the efficacy of 
unlabelled ligand (Tocris nociceptin). It is clear that the LC/A-nociceptin-HN/A fusion 
is far superior to the nociceptin-LC/A-HN/A fusion at interacting with the ORLi 
receptor. 

25 

Figure 9 • /n vitro catalytic activity of a LC/A-nociceptin-HN/A fusion protein 

The in vitro endopeptidase activity of the purified LC/A-nociceptin-HN/A fusion 
protein was detemiined essentially as described in Chaddock et ai 2002, Prot. 
30 , Express Purif . 25, 21 9.-228. Briefly, SNAP-25 peptide immobilised to an ELISA plate 
was exposed to varying concentrations of fusion protein for 1 hour at 37°C. 
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Following a series of washes, the amount of cleaved SNAP-25 peptide was 
quantified by reactivity with a specific antisera. 

Figure 10 - Purification of a LC/A-nociceptin variant-HN/A fusion protein 

5 

Using the methodology outlined In Example 26, an LC/A-nociceptIn variant-HN/A 
fusion protein was purified from E coli BL21 cells. Briefly, the soluble products 
obtained following cell dismption were applied to a nickel-charged affinity capture 
column. Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa 

10 to activate the fusion protein and remove the maltose-binding protein (MBP) tag, 
then re-applled to a second nickel-charged affinity capture column. Samples from 
the purification procedure were assessed by SDS-PAGE. The final purified material 
In the absence and presence of reducing agent is Identified in the lanes marked [-] 
and [+] respectively. 

15 — .- 

Figure 11 - Comparison of binding efficacy of a LC/A-nociceptin-HN/A fusion 
protein and a LC/A-nociceptin variant-HN/A fusion protein 

The ability of nodceptln fusions to bind to the ORLi receptor was assessed using a 
20 simple competition-based assay. Primary cultures of dorsal root ganglia (DRG) were 
exposed to varying concentrations of test material in the presence of InM [3H]- 
nociceptin. The reduction in specific binding of the radlolabelled ligand was 
assessed by scintillation counting, and plotted in comparison to the efficacy of 
unlabelled ligand (Tocris nociceptin). It is clear that the LC/A-nociceptin variant-HN/A 
25 fusion (CPNv-LHA) is superior to the LC/A-nociceptin variant-HN/A fusion (CPN- 
LHA) at interacting with, the ORLi receptor. 

Figure 12 • Expressed / purified LC/A-nociceptin-HN/A fusion protein family 
witli variable spacer length product(s) 

30 

Using the methodology outlined in Example 26, variants of the LC/A-CPN-Hn/A 
■ fusion- 'consisting of GSIO, GS30 and HX27 are purified from £. coli cell paste. 
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Samples from the purification of LC/A-CPN(GS10)-Hn/A. LC/A-CPN(GS15)-Hn/A. 
LC/A-CPN(GS25)-Hn/A, LC/A-CPN(GS30)-Hn/A and LC/A-CPN(HX27)-Hn/A were 
assessed by SDS-PAGE prior to staining with Coonnassie Blue. The electrophoresis 
-profile-indieates-puf ification^f a-disulphide-bonded di-chain species of the expected 

5 molecular mass of CPBE-A. Top panel: M = benchmark molecular mass markers; S 
= total £. CO// protein soluble fraction; FT = proteins that did not bind to the Ni^*- 
charged Sepharose column; FUSION = fusion protein eluted by the addition of 
imidazole. Bottom panel: Lane 1 = benchmark molecular mass markers; Lane 2 = 
total £. CO// protein soluble fraction; Lane 3 = purified material following initial capture 

10 on Ni^*-charged Sepharose; Lane 4 = Factor Xa treated material prior to final 
capture on Ni^*-charged Sepharose; Lane 5 = purified final material post activation 
with Factor Xa (5 pi); Lane 6 = purified final material post activation with Factor Xa 
(1 0 pi); Lane 7 = purified final material post activation with Factor Xa (20 pi); Lane 8 
= purified final material post activation with Factor Xa + DTT (5 pi); Lane 9 = purified 

15 -final-material post activation with Factor Xa + DTT (10 pi); Lane 10 = purified final 
material post activation with Factor Xa + DTT (20 pi). 

Figure 13 - Inhibition of SP release and cleavage of SNAP-25 by CPN-A 

20 Briefly, primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPN-A for 24 hours. Cellular proteins were separated by SDS- 
PAGE, Westem blotted, and probed with anti-SNAP-25 to facilitate an assessment 
of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was calculated by 
densitometric analysis and plotted against fusion concentration (dashed line). 

25 Material was also recovered for an analysis of substance P content using a specific 
EIA kit. Inhibition of substance P release is illustrated by the solid line. The fusion 
concentration required to achieve 50% maximal SNAP-25 cleavage is estimated to 
. be 6.30±2.48 nM. 

30 
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Figure 14 - Inhibition of SP release and cleavage of SNAP-25 over extended 
time periods after exposure of DRG to CPN-A 

— Pf imaiy- cultur«s-of— dorsal- F0et-ganglia-(BRG)- were -exposed- to varying 
5 concentrations of CPN-A for 24 hours. Botulinum neurotoxin (BoNT/A)wasusedas 
a control. After this initial exposure, extracellular material was removed by washing, 
and the cells incubated at 37°C for varying periods of time. At specific time points, 
cellular proteins were separated by SDS-PAGE, Western blotted, and probed with 
anti-SNAP-25 to facilitate an assessment of SNAP-25 cleavage. The percentage of 
10 cleaved SNAP-25 was calculated by densitometric analysis and illustrated by the 
dotted lines. Material was also recovered for an analysis of substance P content 
using a specific EIA kit. Inhibition of substance P release is illustrated by the solid 
lines. 

15 Figure 1 5 - Cleavage of SNAP-25 by CPNv-A 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPNv-A for 24 hours. Cellular proteins were separated by SDS- 
PAGE, Western blotted, and probed with anti-SNAP-25 to facilitate an assessment 
20 of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was calculated by 
densitometric analysis. The fusion concentration required to achieve 50% maximal 
SNAP-25 cleavage is estimated to be 1 .38±0.36 nM. 

Figure 16 - Cleavage of SNAP-25 over extended time periods after exposure 
25 of DRG to CPNv-A 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPNv-A for 24 hours. CPN-A was used as a control. After this 
initial exposure, extracellular material was removed by washing, and the cells 
30 incubated at 37X for varying periods of time. At specific time points, cellular 
proteins were separated by SDS-PAGE, Western blotted, and probed with anti- 
- ... r -^NAP-25 to facilitate an assessrhent of SNAP-25 cleavage. The percentage, of 
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cleaved SNAP-25 was calculated by densitometric analysis. 

Figure 17 - CPNv-A fusion-mediated displacement of [3H]-nociceptin 
-binding- 

5 

The ability of noclceptln fusions to bind to the ORLi receptor was assessed using a 
simple competition-based assay. Primary cultures of dorsal root ganglia (DRG) were 
exposed to varying concentrations of test material in the presence of 1 nM [SH]- 
nociceptin. The reduction in specific binding of the radlolabelled ligand was 
10 assessed by scintillation counting, and plotted in comparison to the efficacy of 
unlabelled ligand (Tocris nociceptin). It Is clear that the LC/A-nociceptin variant-HN/A 
fusion (labelled as CPNv-LHnA) is superior to the LC/A-nociceptin-HN/A fusion 
(labelled as CPN-LHnA) at Interacting with the ORLi receptor. 

15 Figure 18 - Expressed / purified CPNv(Ek)-A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
electrophoresis profile indicates purification of a disulphide-bonded di-chain species 
of the expected molecular mass of CPNv(Ek)-A. Lane 1 = benchmari^ molecular 

20 mass mari<ers; Lane 2 = total £. coll protein soluble fraction; Lane 3 = purified 
material following Initial capture on Ni^*-charged Sepharose; Lane 4 = purified final 
material post activation with enterokinase (5 pi); Lane 5 = purified final material post 
activation with enterokinase (10 pi); Lane 6 = purified final material post activation 
with enterokinase (20 |jl); Lane 7 = purified final material post activation with 

25 enterokinase + DTT (5 Lane 8 = purified final material post activation with 
enterokinase + DTT (10 pi); Lane 9 = purified final material post activation with 
enterokinase + DTT (20 pi). 

Figure 19 ■ Cleavage of SNAP-25 by CPNv(Ek)-A 

30 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
■ ■ concentrations »f.GPNv(Ek)rA f0K-24vhour§.- CeUularcproteins- were separated ,by 
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SDS-PAGE. Western blotted, and probed with antl-SNAP-25 to facilitate an 
assessment of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was 
calculated by densitometric analysis. CPNv-A as prepared in Example 26 was used 
foi^comparison purposes: The-percentage cleavage of SNAP-25 by CPNv{Ek)-A 
5 (labelled as En activated) and CPNv-A (labelled as Xa activated) are illustrated. 

Figure 20 - Expressed / purified CPNv-<; product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
10 electrophoresis profile indicates purification of a disulphide-bonded di-chain species 
of the expected molecular mass of CPNv-C. Lane 1 = benchmark molecular mass 
markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = purified material 
following initial capture on Ni^*-charged Sepharose; Lane 4 = Factor Xa treated 
material prior to final capture on Nl^*-charged Sepharose; Lane 5 = purified material 
15 following second capture on Ni^*-charged Sepharose; Lane 6 = final purified 
material; Lane 7 = final purified material + DTT; Lane 8 = benchmari^ molecular 
mass markers. 

Figure 21 - Cleavage of syntaxin by CPNv-C 

20 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPNv-C for 24 hours. Cellular proteins were separated by SDS- 
PAGE, Western blotted, and probed with anti-syntaxin to facilitate an assessment of 
syntaxin cleavage. The percentage of cleaved syntaxin was calculated by 
25 densitometric analysis. The fusion concentration required to achieve 50% maximal 
syntaxin cleavage is estimated to be 3.1 3±1 .96 nM. 

Figure 22 - CPN-A efficacy in tiie Acute Capsaicin-lnduced Mechanical 
Allodynia model 

30 

The ability of an LC/A-nociceptin-HN/A fusion (CPN/A) to inhibit capsaicin-induced 
« vmechanicaliairodiajia was'Qvaluatecfcfeftowingsa in 
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the rat hind paw. Test animals were evaluated for paw withdrawal frequency 
(PWF%) in response to a 10 g Von Frey filament stimulus series (10 stimuli x 3 
trials) prior to recmitment into the study (Pre-Treat); after subcutaneous intraplantar 
-treatment with CRN/A but before capsaicin (Pre-CAP); and following capsaicin 
5 challenge post-injection of CPN/A (average of responses at 15' and 30'; CAP). 
Capsaicin challenge was achieved by injection of 1 0 |jL of a 0.3% solution. Sample 
dilutions were prepared in 0.5% BSA/sallne. 

Figure 23 - CPN-A efficacy in the Streptozotocin (STZ)-lnduced Peripheral 
10 Diabetic Neuropathy (Neuropathic Pain) model 

Male Sprague-Dawley rats (250-300 g) are treated with 65 mg/kg STZ In citrate 
buffer (I.V.) and blood glucose and lipid are measured weekly to define the 
readiness of the model. Paw Withdrawal Threshold (PWT) Is measured In response 

15 -to -a Von Frey filament-stimulus-serlesT)ver a period of time. Allodynia Is said to be 
established when the PWT on two consecutive test dates (separated by 1 week) 
measures below 6 g on the scale. At this point, rats are randomized to either a saline 
group (negative efficacy control), gabapentin group (positive efficacy control) or a 
test group (CPN/A). Test materials (20-25 pi) are Injected subcutaneously as a 

20 single injection (except gabapentin) and the PWT is measured at 1 day post- 
treatment and periodically thereafter over a 2 week period. Gabapentin (30 mg/kg 
i.p. @ 3 ml/kg injection volume) is Injected daily, 2 hours prior to the start of PWT 
testing. 

25 Figure 24 - CPNv-A efficacy in the Acute Capsaicin-lnduced IMechanical 
Allodynia model 

The ability of an LC/A-noclceptin variant-HN/A fusion (CPNv/A) to inhibit capsaicin- 
induced mechanical allodynia was evaluated following subcutaneous intraplantar 
30 injection in the rat hind paw. Test animals were evaluated for paw withdrawal 
frequency (PWF%) in response to a 10 g Von Frey filament stimulus series (10 
... . .-,:rr-^ . . ic, ."=i,itj^fti]iii«.34iiats)*pri§F^I^^^ (Pre-Treat); after, subcutaneous,, , x*,^:^ WW"^ 
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intraplantar treatment with CPNv/A but before capsaicin (Pre-CAP), and following 
capsaicin challenge post-injection of CPNv/A (average of responses at 15' and 30'; 
CAP). Capsaicin challenge was achieved by injection of 10 pL of a 0.3% solution. 
- Sample-dllutions-weFe prepared in &.€% BSA/saline. These data are expressed as a 
5 nonmalized paw withdrawal frequency differential, in which the difference between 
the peak response (post-capsaicin) and the baseline response (pre-capsaicin) is 
expressed as a percentage. With this analysis, it can be seen that CPNv/A is more 
potent than CPN/A since a lower dose of CPNv/A is required to achieve similar 
analgesic effect to that seen with CPN/A. 

10 

Figure 25 - Expressed / purified LC/A-CPLE-Hn/A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
electrophoresis profile indicates purification of a disulphlde-bonded dl-chain species 

15 of the expected molecular mass of CPLE^A. Lane 1 = benchmark molecular mass 
markers; Lane 2 = total £. coli protein soluble fraction; Lane 3 = purified material 
following initial capture on Ni^*-charged Sepharose; Lane 4 = Factor Xa treated 
material prior to final capture on Ni^*-charged Sepharose; Lane 5 = purified material 
following second capture on Ni^*-charged Sepharose; Lane 6 = final purified 

20 material; Lane 7 = final purified material + DTT. 

Figure 26 - Expressed / purified LC/A-CPBE-Hn/A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
25 electrophoresis profile indicates purification of a disulphide-bonded di-chain species 
of the expected molecular mass of CPBE-A. Lane 1 = total E coli protein soluble 
fraction; Lane 2 = purified material following Initial capture on NP-charged 
Sepharose; Lane 3 = Factor Xa treated material prior to final capture on Nl^*- 
charged Sepharose; Lane 4 = purified final material post activation with Factor Xa (5 
30 pi); Lane 5 = purified final material post activation with Factor Xa (1 0 pi); Lane 6 = 
purified final material post activation with Factor Xa (20 pi); Lane 7 = purified final 
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post activation with Factor Xa + DTT (10 ijI); Lane 9 = purified final material post 
activation with Factor Xa + DTT (20 |jl); Lane 10 = benchmark molecular mass 
markers. 

s Figure 27 - Expressed / purified CPOP-A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
electrophoresis profile indicates purification of a disulphide-bonded di-chain species 
of the expected molecular mass of CPOP-A. Lane 1 = benchmark molecular mass 

10 markers; Lane 2 = purified material following initial capture on Ni^*-charged 
Sepharose; Lane 3 = Factor Xa treated material prior to final capture on Ni^*- 
charged Sepharose; Lane 4 = purified material following second capture on Ni^*- 
charged Sepharose; Lane 5 = purified final material post activatfon with Factor Xa (6 
pi); Lane 6 = purified final material post activation with Factor Xa (10 pi); Lane 7 = 

15 purified final material post-activation with Factor Xa (20 pi); Lane 8 = purified final 
material post activation with Factor Xa + DTT (5 pi); Lane 9 = purified final material 
post activation with Factor Xa + DTT (10 pi); Lane 10 = purified final material post 
activation with Factor Xa + DTT (20 pi). 

20 Figure 28 - Expressed / purified CPOPv-A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
electrophoresis profile indicates purification of a disulphide-bonded di-chain species 
of the expected molecular mass of CPOPv-A. Lane 1 = benchmark molecular mass 

25 markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = purified material 
following initial capture on Ni^*-charged Sepharose; Lane 4 = Factor Xa treated 
material prior to final capture on Ni^*-charged Sepharose; Lane 5 = purified final 
. material post activation with Factor Xa (5 pi); Lane 6 = purified final material post 
activation with Factor Xa (10 pi); Lane 7 = purified final material post activation with 

30 Factor Xa (20 pi); Lane 8 = purified final material post activation with Factor Xa + 
DTT (5 pi); Lane 9 = purified final material post activation with Factor Xa + DTT (1 0 
Vfx o";**;." >5^J)f;CShe^^^ ='t-lpflrifiea'fiRal'fnaterial post«GAK'atiOT% With Fadtcrif;Xsti+ DTT (2X):yl)..j -"T^g-^ Fv-V-^ 
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Figure 29 - In vitro SNAP-25 cleavage in a DRG cell model 

5 Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPOPv-A for 24 hours. Cellular proteins were separated by SDS- 
PAGE, Western blotted, and probed with anti-SNAP-25 to facilitate an assessment 
of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was calculated by 
densitometric analysis. 

10 

Figure 30 - Expressed / purified CPNv-A-FXa-HT (removable his-tag) 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. The 
electrophoresis profile indicates purification of a disulphide-bonded di-chain species 
of the expected- molecular mass of CPNv-A-FXa-HT. Lane t = benchmari< 
molecular mass markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = 
Factor Xa treated material prior to final capture on Ni^*-charged Sepharose; Lane 4 
= purified final material post activation with Factor Xa; Lane 5 = purified final material 
post activation with Factor Xa + DTT. 

Figure 31 - In vitro efficacy of LC/A-nociceptin-HN/A fusion proteins with 
variable spacer length, as assessed by ligand competition assay 

The ability of LC/A-nociceptin-HN/A fusions of variable spacer length to bind to the 
25 ORLi receptor was assessed using a simple competition-based assay. Primary 
cultures of doreal root ganglia (DRG) were exposed to varying concentrations of test 
material in the presence of 1 nM [3H]-nociceptin. The reduction In specific binding of 
the radiolabelled ligand was assessed by scintillation counting, and plotted in 
comparison to the efficacy of unlabelled ligand (Tocris nociceptin). The upper panel 
30 illustrates the displacement characteristics of the GSO, GS20, GS30 and Hx27 
spacers, whilst the lower panel illustrates the displacement achieved by the GS 1 0, 
- - .V K • «>^* ";c -GST5'^nd '@S25 spaced fusion pifoteiFrs; 'rlt;4§-' (^eluded thettitha GSO 9nd-GS3.Q r -:7"V*i 



15 



20 
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spacers are ineffective, and the GS10 is poorly effective, at displacing nociceptin 
from the 0RL1 receptor. 



Figure 32 - In vitro efficacy of LC/A-nociceptin-HN/A fusion proteins witli 
5 variable spacer length, as assessed by in vitro SNAP-25 cleavage 



Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPN-A (of variable spacer length) for 24 hours. Cellular proteins 
were separated by SDS-PAGE, Western blotted, and probed with anti-SNAP-25 to 

10 facilitate an assessment of SNAP-25 cleavage. The percentage of cleaved SNAP- 
25 was calculated by densitometric analysis. The poorly effective binding 
characteristics of the GS1 0 spaced fusion protein (see Figure 28) are reflected in the 
higher concentrations of fusion required to achieve cleavage of intracellular SNAP- 
25. GSO and GS30 spaced fusion proteins were completely ineffective (date not 

15 -shown). GS15, 20 and 25-spaced fusion proteins were similarly effective. 



SEQ ID Nos 

SEQ ID1 DMA sequence of N[1-17] 

20 SEQ I D2 Protein Sequence of N[1 -17] 

SEQ IDS. DMA sequence of N[1-1 1] 

S EQ I D4 Protein sequence of N [1 -1 1 ] 

SEQ IDS DMA sequence of N[[Y10]1-1 1] 

SEQ ID6 Protein sequence of N[[Y10]1-1 1] 

25 SEQ ID7 DMA sequence of N[[Y1 111-1 1] 

SEQ IDS Protein sequence of N[[Y1 IJI-1 1] 

SEQ ID9 DNA sequence of N[[Y1411-17] 

. SEQID10 Protein sequence of N[IY14]1 -17] 

SEQ ID1 1 DNA sequence of N[1-13] 

30 SEQ ID12 Protein sequence of N[1-1 3] 

SEQ ID13 DNA sequence of Nv (also known as N[[R14K15]1-17]) 
k \7 >SEQ ID44V sPTOtiairi' seqaei^<3©.'6{^r^'oirfc^ 5]ri -1 7]) ;^ . %.f . . - f,i^. 
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SEQ ID15 DNA sequence of N[1-1 7]-LHn/A fusion protein 

SEQ ID16 Protein sequence of N[1-1 7]-LHn/A fusion protein 

SEQ ID17 DNA sequence of N[[Y1 1]1-1 1]-LHN/A fusion protein 

SEQ ID18 Protein sequence of N[[Y1 1]1-1 1]-LHN/A fusion protein 

5 SEQ ID1 9 DNA sequence of N[1 -1 3]-LHN/A fusion protein 

SEQ ID20 Protein sequence of N[1 -1 3]-LHN/A fusion protein 

SEQ ID21 DNA sequence of LHN/A-N[1-171 fusion protein 

SEQ ID22 Protein sequence of LHN/A-N[1 -1 7] fusion protein 

SEQ ID23 DNA sequence of LHN/C-N[1-1 1] fusion protein 

10 SEQ ID24 Protein sequence of LHN/C-N[1 -1 1 ] fusion protein 

SEQ ID25 DNA sequence of N[[Y14]1-17]-LHN/C fusion protein 

SEQ ID26 Protein sequence of N[[Y14]1-1 71-LHN/C fusion protein 

SEQ iD27 DNA sequence of tlie LC/A 

SEQ ID28 DNA sequence of the Hn/A 

15 SEQID29 DNAsequenceoftheLC/B 

SEQ ID30 DNA sequence of tlie Hn/B 

SEQ ID31 DNA sequence of the LC/C 

.SEQID32 DNA sequence of the Hn/C 

SEQ ID33 DNA sequence of the CPN-A linker 

20 SEQ ID34 DNA sequence of the A linker 

SEQ ID35 DNA sequence of the N-temiinal presentation noclceptin insert 

SEQ ID36 DNA sequence of the CPN-C linker 

SEQID37 DNA sequence of the CPBE-A linker 

SEQ ID38 DNA sequence of the CPNvar-A linker 

25. SEQID39 DNA sequence of the LC/A-CPN-Hn/A fusion 

SEQ ID40 Protein sequence of the LC/A-CPN-Hn/A fusion 

SEQ ID41 DNA sequence of the N-LC/A-Hn/A fusion 

. SEQID42 Protein sequence of the N-LC/A-Hn/A fusion 

SEQ ID43 DNA sequence of the LC/C-CPN-Hn/C fusion 

30 SEQ 1044 Protein sequence of the LC/C-CPN-Hm/C fusion 

SEQ ID45 DNA sequence of the LC/C-CPN-Hn/C (A-linker) fusion 
^..:-«r.> T:/aEQill^.r.8rot0iFfjssquent»iaftl^^ . . . » 
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SEQ ID47 DNA sequence of the LC/A-CPME-Hn/A fusion 

SEQ I D48 Protein sequence of tlie LC/A-CPI\/1E-Hn/A fusion 

SEQ ID49 DNA sequence of the LC/A-CPBE-Hn/A fusion 

SEQ IO50- -Protein-sequenee^f-theH=G/A-GPBE-HN/A-fusion 

5 SEQ ID51 DNA sequence of the LC/A-CPNv-Hn/A fusion 

SEQ ID52 Protein sequence of the LC/A-CPNv-Hn/A fusion 

SEQ ID53 DNA sequence of the LC/A-CPN[1 -1 11-HN/A fusion 

SEQ ID54 Protein sequence of the LC/A-CPN[1-1 1]-HN/A fusion 

SEQ iD55 DNA sequence of the LC/A-CPN[[Y10]1 -1 1]-HN/A fusion 

10 SEQ ID56 Protein sequence of the LC/A-CPN[[Y1 0]1-1 1]-HN/A fusion 

SEQ ID57 DNA sequence of the LC/A-CPN[[Y1 1]1-1 1]-HN/A fusion 

SEQ ID58 Protein sequence of the LC/A-CPN[[Y1 111-1 1]-HN/A fusion 

SEQ ID59 DNA sequence of the LC/A-CPNIIYUJI-ITl-HN/A fusion 

SEQ ID60 Protein sequence of the LC/A-CPN[[Y14]1-1 7]-HN/A fusion 

15 SEQ ID61 DNA sequence of the LC/A-CPN[1-13]-HN/A fusion 

SEQ 1062 Protein sequence of the LC/A- CPN[1-13]-HN/A fusion 

SEQ I D63 DNA sequence of the nociceptin-spacer-LC/A-HN/A fusion 

SEQ ID64 Protein sequence of the nociceptin-spacer-LC/A-HN/A fusion 

SEQ ID65 DNA sequence of the CPN-A GS1 0 linker 

20 SEQ ID66 DNA sequence of the CPN-A GS1 5 linker 

SEQ .ID67 DNA sequence of the CPN-A GS25 linker 

SEQ I D68 DNA sequence of the CPN-A GS30 linker 

SEQID69 DNA sequence of the CPN-A HX27 linl<er 

SEQ ID70 DNA sequence of the LC/A-CPN(GS1 5)-Hn/A fusion 

25 SEQID71 Protein sequence of the LC/A-CPN(GS15)-Hn/A fusion 

SEQID72 DNA sequence of the LC/A-CPN(GS25)-Hn/A fusion 

SEQ ID73 Protein sequence of the LC/A-CPN(GS25)-Hn/A fusion 

SEQ ID74 DNA sequence of the CPNvar-A Enterokinase activatabie linker 

SEQID75 DNA sequence of the LC/A-CPNv(Ek)-HN/A fusion 

30 . SEQ ID76 Protein sequence of the LC/A-CPNv(Ek)-HN/A fusion 

SEQ ID77 DNA sequence of the CPNvar-A linkei- 
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SEQ ID79 Protein sequence of the LC/C-CPNv-Hn/C fusion (act. A) 

SEQ ID80 DNA sequence of the LC/A-CPLE-Hn/A fusion 

SEQ ID81 Protein sequence of the LC/A-CPLE-Hn/A fusion 

SEQ-ID82 DNA sequence of the LC/A-CPOP-Hn/A fusion 

5 SEQ ID83 Protein sequence of the LC/A-CPOP-Hn/A fusion 

SEQ ID84 DNA sequence of the LC/A-CPOPv-Hn/A fusion 

SEQ ID86 Protein sequence of the LC/A-CPOPv-Hn/A fusion 

SEQ ID86 DNA sequence of the IgA protease 

SEQ ID87 DNA sequence of the IgA-CPNv-HN/A fusion 

10 SEQ ID88 Protein sequence of the IqA-CPNv-Hn/A fusion 

SEQ ID89 DNA sequence of the FXa-HT 

SEQ ID90 DNA sequence of the CPNv-A-FXa-HT 

SEQ ID91 Protein sequence of the CPNv-A-FXa-HT fusion 

SEQ ID92 DNA sequence of the DT translocation domain 

15 SEQ ID93 DNA sequence of the CPLE-DT-A 

SEQ ID94 Protein sequence of the CPLE-DT-A fusion 

SEQ ID95 DNA sequence of the TeNT LC 

SEQ ID96 DNA sequence of the CPNv-TENT LC 

SEQ ID97 Protein sequence of the CPNV-TeNT LC fusion 

20 SEQ ID98 DNA sequence of the CPNvar-C linker 

SEQ ID99 DNA sequence of the LC/C-CPNv-Hn/C fusion (act. C) 

SEQ ID1 00 Protein sequence of the LC/C-CPNv-Hn/C fusion (act. 0) 



25 Examples 

Example 1 - Confirmation of TM Agonist Activity by measuring release of 
substance P from neuronal cell cultures 

30 Materials 

Substance P ElA Is obtained from R&D Systems, UK. 
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Methods 

Primary neuronal cultures of eDRG are established as described previously (Duggan 
etaL, 2002). Substance P release from the cultures Is assessed by EIA, essentially 
-as-deseribed previously (Duggan etal., 2002)-. ^he TM of Interest is added to the 

5 neuronal cultures (established for at least 2 weeks prior to treatment); control 
cultures are perfomied in parallel by addition of vehicle In place of TM. Stimulated 
(100 mM KCI) and basal release, together with total cell lysate content, of 
substance P are obtained for both control and TM treated cultures. Substance P 
Immunoreactivity is measured using Substance P Enzyme Immunoassay Kits 

10 (Cayman Chemical Company, USA or R&D Systems, UK) according to 
manufacturers' instructions. 

The amount of Substance P released by the neuronal cells In the presence of the 
TM of Interest Is compared to the release obtained In the presence and absence of 
15 -100 mM KCI. Stimulation of Substance P release by the TM of Interest above the 
basal release, establishes that the TM of interest is an "agonist ligand" as defined in 
this specification. If desired the stimulation of Substance P release by the TM of 
Interest can be compared to a standard Substance P release-curve produced using 
the natural ORL-1 receptor ligand, noclceptin (Tocrls). 

20 

Example 2 - Expression and purification of catalyticaliy active LHn/A 

Materials 

Synthetic DNA obtained from Sigma Genosys. 
25 Restriction enzymes obtained from New England Blolabs. 

Methods 

. The expression and purification of catalyticaliy active LHn/A was earned out 
essentially as described in Sutton et al., (2005), Prot. Express. Purif., 40, pp 31-41 . 

30 

Briefly, DNA encoding the light chain plus 423 amino acids from the N-terminal of the 
• n.^^.'.- « -V beavytchain of BdNT/A.was synthesjsed by SlgrDa^Qenosys to prqdwcg q .synthetic ■ > 
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LHn/A gene with an E. co// codon bias. Tlie llnl<er region between the light chain and 
Hn domain was engineered to contain a Factor Xa cleavage site by splice-overlap 
extension PGR. Two PGR products were generated using primer pairs consisting of 
^-iongi mutagenic p^ime^a^d a shorter, non-mutagenic primer: 

5 

(5-tccaaaactaaatctctgATAGAAGGTAGAaacaaagcgctgaacgac)with 
(5'-CTTGATGTACTCTGTGAACGTGCTC); and 

(5'-gtcgttcagcgctttgttTCTACCTTCTATcagagatttagttttgga)with 
10 (5'-ATGGAGTTCGTTAACAAACAGTTC). 

The products firom these two reactions were used as templates for the splice-overlap 
extension PGR. A further PGR reaction was set up to add fiamHI and Sa/I sites at 
either end of the activatable recLHw/A gene and these sites were used for insertion 
15 into an Invitrogen gateway entry vector. The entry vector was then used, along with 
a gateway recombination site adapted pMAL c2x, in a LR clonase reaction to fomn 
pMAL c2x recLHN/A. The pMAL c2x recLHN/A was modified to incorporate a 6'HIS 
tag at the N-tenminus of the MBP. This was achieved by the insertion of annealed 
oligonucleotides encoding the HIS tag into the A/del site of pMAL. 

20 

The expression vector e)?pressing LHn/A was transfonned Into E. coli HMS174 or 
AD494(DE3) (Novagen). Gultures were grown in Tenrific broth complex medium 
supplemented with ZnCb (1 |JM), ampicillin (100 |jg/ml), 0.2% (w/v) glucose. 
Parameters for expression of all the constructs were initially determined in shake 
25 flask cultures before transferring into 8 L fermentor systems. Starter cultures were 
grown for 1 6 hours at ST'G, 220 rpm and used to inoculate 1 L in which growth was 
continued at 37°C, 250 rpm. At an OD600 nm of 0.6 the temperature was reduced 
to 25X for 30 minutes before induction with 1 mM IPTG. Induction was continued 
for 4 hours before the cells were harvested and stored at -70°G. 

30 

Typically 16 g of cell paste was suspended in 160 ml PBS and lysed by sonication 
« . ... ir-(.MSEvSoniprep^1SG) The ^sultiRg;-lys@te"was\clarified'by«centrifugation prior f C^? . 
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loading onto a 25 ml amylose column and eluted with 10 mM maltose in PBS, The 
eluant contained approx. 50% pure fusion protein and was treated with Factor Xa 
(1 unit Factor Xa /1 00 |jg fusion protein; 20 hours; 26°C) to remove the HISMBP and 
cleavfr the ItG-Hn junctlon-to activate the protein-. - After incubation the sample was 

5 filtered (0.45 mm) and diluted two fold with water to give a 0.5 x PBS buffer 
composition. The cleaved, filtered and diluted recLHN/A was processed through a Q 
Sepharose FF column (10 ml) and eluted with a step gradient of 80 mM NaCI 
containing HISMBP and 120 mM NaCI containing approx. 75% pure recLHN/A. The 
addition of His tag to MBP overcame previous co-elution problems with LHn/A and 

10 MBP. As a final polishing step to ensure complete removal of the HISMBP, the 
120 mM NaCI elution from the Q Sepharose column was passed through a Nickel 
charged 5 ml HIsTrap column (Amersham). The flow through from the HisTrap 
column contained approx. 95% pure recLHN/A (see the Figures in Sutton ef a/., 
(2005), Prot. Express. Purif., 40, pp 31-41 for an illustration of the purification 

15 scheme for LHN/A). - - - 

Example 3 - Expression and purification of catalytically active recombinant 
LHn/B 

20 The methodology described below vwll purify catalytically active LHn/B protease from 
£. CO// transfonned with the appropriate plasmid encoding the LHn/B polypeptide. It 
should be noted that various sequences of suiteible LHn/B polypeptides have been 
described in PCT/GB97/02273, granted US 6 461617 and US patent application 
10/241596, incorporated herein by reference. 

25 

Methods 

The coding region for LHn/B is inserted in-frame to the 3' of the gene encoding 
maltose binding protein (MBP) in the expression vector pMAL (New England 
Biolabs) to create pMAL- c2x-LHn/B. In this construct, the expressed MBP and 
LHn/B polypeptides are separated by a Factor Xa cleavage site, and the LC and Hn 
domains are separated by a peptide that is susceptible to cleavage with 
T f/si r. 4^ .;rt^-eRterokiBase^3lh¥eexprefi»fon<«lo»8its ten^^ 
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pMAL-c2X-synLHN/B is transformed into E. coli HMS174 and cultured in Temfic 
broth complex medium in 8 L fermentor systems. Pre-induction bacterial growth is 
• maintained at 37°G to an-0D6OO nm of 5;0, at which stage expression of recMBP- 
LHn/B is induced by addition of IPTG to 0.5 mM and a reduction in temperature to 
30°C. After four hours at 30°C the bacteria are harvested by centrifugation and the 
resulting paste stored at -70°C. 

The cell paste is resuspended in 20 mM Hopes pH 7.2, 125 mM NaCI, 1 pM ZnCh 
and cell disruption achieved using an APV-Gaulin lab model 1000 homogeniseror a 
MSE Soniprep 1 50 sonicator. The resulting suspension is clarified by centrifugation 
prior to purification. 

Following cell disruption, the MBP-fusion protein is captured either on an amylose 
affinity resin in 20 mM Hepes pH -7.2, 125 mM NaCI, 1 pM ZnCb, or on a Q- 
Sepharose FF anion-exchange resin in 50 mM Hepes pH 7.2, 1 pM ZnCIa with no 
salt. A single peak is eluted from the amylose resin in the same buffer plus 10 mM 
maltose and from the Q-Sepharose in 150-200 mM salt. Cleavage of the MBP- 
LHn/B junction is completed in an 18 hours incubation step at 22"'C with Factor Xa 
(NEB) at 1 U/50 pg fusion protein. A substrate (MBP-LHn/B) concentration of at 
least 4 mg/ml is desirable for efficient cleavage to take place. 

The cleaved protein is diluted with 20 mM Hepes to a buffer composition of 20 mM 
Hepes, 25 mM NaCI, 1 pM ZnCb, pH 7.2 and processed through a Q Sepharose 
column to separate the MBP from LHn/B. The LHn/B is eluted from the Q- 
Sephanase column with 120-170 mM salt. The linker between the light chain and Hn 
domain is tiien nicked by incubation with enterokinase at 1 U/100 pg of LHn/B at 
22''C for 1 6 hours. Finally, the enterokinase is separated from the nicked LHn/B and 
other contaminating proteins on a Benzamidine Sepharose column, the enzyme 
preferentially binding to the resin over an incubation of 30 minutes at 4''C. Purified 
LHn/B is stored at -20°C until required. See Figure 1 for an illustration of the 
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Example 4 - Expression and purification of catalytically active recombinant 
LHn/C 

The coding region for LHn/C is Inserted In-frame to the 3' of the gene encoding 
maltose binding protein (MBP) In the expression vector pMAL (New England 
Biolabs) to create pIVIAL- c2x-LHn/C. In this construct the expressed IVIBP and 
LHn/C polypeptides are separated by a Factor Xa cleavage site. 

pMAL-c2x-LHn/C is transformed Into £. coli AD494 (DE3, IRL) and cultured in 
Teniflc broth complex medium In 8 L femientor systems. Pre-lnduction bacterial 
growth are maintained at 30°C to an OD600 nm of 8.0, at which stage expression of 
recl\/IBP-c2x-LHN/C is induced by addition of IPTG to 0.5 mM and a reduction in 
temperature of culture to 25"'C. After 4 hours at 25°C the bacteria are harvested by 
centrifugation and the resulting paste stored at -yO'C. 

The cell paste Is resuspended In 50 mM Hepes pH 7.2. 1 pM ZnCb at 1 :6 (w/v) and 
cell disruption is achieved using an APV-Gaulln lab model 1000 homogenlser or a 
MSE Sonlprep 150 sonicator. The resulting suspension is clarified by centrifugation 
prior to purification. 

5 

Following cell disruption and clarification, the MBP-fuslon protein Is separated on a 
Q-Sepharose Fast Flow anion-exchange resin in 50 mM Hepes pH 7.2, 1 pM ZnCIa 
and eluted with the same buffer plus 100 mM NaCI. A double point cleavage is 
performed at the MBP-LHn/C junction and the Hn-LC linker In a single incubation 
10 step with Factor Xa. The reaction is completed in a 1 6-hour incubation step at 22°C 
with Factor Xa (NEB) at 1 U/lbO Ig fusion protein. The cleaved protein Is diluted 
with 20 mM Hepes to a buffer composition of 20 mM Hepes, 25 mM NaCI, pH 7.2 
and processed through a second Q-Sepharose column to separate the MBP from 
LHn/C. Activated (disulphide-bonded cleaved linker) LHn/C Is eluted from the Q- 
15 Sepharose column by a salt gradient (20 mM Hepes, 500 mM NaCI, 1 pM ZnCb, 
... .a: ^.^.ixr'm' -^yfi'P f^ Figure 2 for an Illustration of , the purlficatipn>of -^^-^^^^^^ 
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LHn/C. 



Example 5 - Production of a chemical conjugate of nociceptin and LHn/A 



5 Materials 

C-temiinally extended nociceptin peptide obtained from Sigma Genosys. 
Conjugation cliemicals obtained from Pierce. 



Methods 

10 In order to couple tiie nociceptin peptide via a C-tenminal Cys, the peptide was first 
syntliesised (by standard procedures, commercially obtainable) to include a Cys as 
the final C-termlnal amino acid. 

This peptide was then used as the second component in a sulphydryl based 
15 coupling reaction as described below (see also previous publications WO 99/1 7806 
and WO 96/33273 and Duggan etal., (2002), J. Biol. Chem. 277, 24846-34852 and 
Chaddock et al., (2000), Infect Immun., 68, 2587-2593). 

Sulphydryl hased coupling reaction 
20 Briefly, approximately two reactive leaving groups were introduced into LHn/A 
(5mg/ml In phosphate-buffered saline) by reaction with A/-succinimidyl-3-(2- 
pyridyldithio)propionate (SPDP). 

Derivatised material was isolated from excess SPDP by size exclusion 
25 chromatography. Reconstituted cysteine-tagged nociceptin ligand was mixed with 
the derivatised LHn/A in a 4:1 molar ratio, and incubated at room temperature for 
1 hour with gentle agitation in order to create a chemical conjugate through a 
reducible covalent disulphide bond. Initial fractionation of the conjugate mixture to 
remove unconjugated peptide was perfonned by size exclusion chromatography 
30 (Superose-1 2, or Superdex G-200 depending on scale of conjugation). 
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Example 6 • Production of a chemical conjugate of nociceptin and LHn/B 

Materials 

5 C-temiinally extended nociceptin peptide obtained from Sigma Genosys. 
Conjugation cliemicals obtained from Pierce. 

Methods 

Lyopiiilised nociceptin was dissolved by tine addition of water and dialysed into MES 
10 buffer (0.1 IVI MES, 0.1 IVI NaCI, pH 5.0). To this solution (at a concentration of about 
0.3 mg/ml) was added PDPH (100 mg/ml in DMF) to a final concentration of 
1 mg/ml. After mixing, solid EDAC was added to produce a final concentration of 
about 0.2 mg/ml. The reaction was allowed to proceed for at least 30 minutes at 
room temperature. Excess PDPH was then removed by desalting over a PD-10 
15 column (Pharmacia) previously equilibrated with MES buffer. 

An amount of LHn/B equivalent to half the weight of nociceptin used dissolved in 
triethanolamine buffer (0.02 M triethanolamine/HCI, 0.1 M sodium chloride, pH 7.8) 
at a concentration of about 1 mg/ml, was reacted with Trauf s reagent (1 00 mM stock 
20 solution in 1 M triethanolamine/HCI, pH 8.0) at a final concentration of 2 mM. After 

1 hour, the LHn/B was desalted into PBSE (phosphate buffered saline with 1 mM 
EDTA) using a PD-10 column (Phannacia). The protein peak from the column 
eluate was concentrated using a Microcon 50 (Amicon) to a concentration of about 

2 mg/ml. 

25 

The derivatised nociceptin was subjected to a final concentration step resulting in a 
reduction in volume to less than 1 0% of the starting volume and then mixed with the 
derivatised LHn/B overnight at room temperature. Theproducts of the reaction were 
analysed by polyacrylamide gel electrophoresis in the presence of sodium dodecyl- 
30 sulphate (SDS-PAGE). 



> The conjugate resulting -from the above reaction was partially purified by size 
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exclusion chromatography over Bio-Gel P-100 (BioRad). The elution profile was 
followed by measuring the optical density at 280 nm and SDS-PAGE analysis of the 
fractions. This allowed the separation of conjugate from free nociceptin and by- 
products of the reaction. 

5 

Example 7 - Production of a chemical conjugate of nociceptin 1-11 and LHn/B 

Materials 

C-terminally extended nociceptin 1-11 peptide obtained from Sigma Genosys. 
10 Conjugation chemicals obtained from Pierce. 

Methods 

In order to couple the nociceptin 1-11 peptide via a C-temninal Cys, the peptide was 
first synthesised (by standard procedures, commercially obtainable) to Include a 
15 • Cys asihe final C-terminal amino ^icid . 

This peptide was then used as the second component in a sulphydryl based 
coupling reaction as described in Example 5. 

20 Example 8 - Production of a chemical conjugate of nociceptin N[[Y14]1-17] and 
LHn/C 

Materials 

C-terminally extended nociceptin N[[Y14]1-17] peptide obtained from Sigma 
25 Genosys. 

Conjugation chemicals obtained from Pierce. 

Metfiods 

In order to couple the peptide via a C-terminal Cys, the peptide was first synthesised 
30 (by standard procedures, commercially obtainable) to include a Cys as the final C- 
tenninal amino acid. 
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This peptide was then used as the secx>nd component in a sulphydryl based 
coupling reaction as described in Example 5. 

Example 9 - Recombinant production of a single polypeptide fusion of 
s nociceptin-LHN/A (SEQ ID15 and SEQ ID16) 

The DNA sequence for the nociceptin-LHrg/A was designed by bacl< translation of the 
LC/A, Hn/A. and nociceptin amino acid sequences. The complete ORF containing 
the nociceptin-LC/A-activation Igop-Hn/A sequence was assembled within standard 
10 DNA sequence manipulation software (EditSeq). The activation loop between the 
LC/A cysteine and the Hn/A cysteine (CVRGIITSKTKSLDKGYNKALNDLC) was 
modified to incorporate a Factor Xa protease recognition site. 

Restriction sites appropriate to facilitate cloning into the required expression vector 
15 (for example BamHI/Salh)Twere incorporated at the 5' and 3' ends respectively of the 
sequence maintaining the correct reading frame. The DNA sequence was screened 
(using software such as MapDraw, DNASTAR Inc.) for restriction enzyme cleavage 
sequences incorporated during the back translation. Any cleavage sequences that 
were found to be common to those required by the cloning system were removed 
20 manually from the proposed coding sequence ensuring common E. coli codon usage 
was maintained. E. coll codon usage was assessed by reference to software 
programs such as Graphical Codon Usage Analyser (Geneart), and the %GC 
content and codon usage ratio assessed by reference to published codon usage 
tables (for example GenBank Release 143, 13 September 2004). 

25 

This optimised DNA sequence containing the nociceptin-LC/A-activation loop-Hn/A 
open reading frame (ORF) was then commercially synthesized and provided in the 
pCR 4 vector. 

30 The DNA encoding the nociceptin-LHN/A fusion was isolated from pCR 4 and 
transfen-ed into pMAL vector backbone to facilitate protein expression. The resultant 
. .r - i-.n • i^M^kt^P^LWIWA yeptorvvygs tcansformed into»competent E. coli BL21 and correct . . w 
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transformants selected. A single colony of pMAL NO-LHn/A was grown in Terrific 
broth complex medium supplemented with ZnCl2 (1 mM), ampicillin (100 |jg/ml), 
0.2% (w/v) glucose. Expression of the Insert was induced by the addition of IPTG 
(0.1-mM)-and the culture maintained at IBX for 16 hours. After this period of 
5 expression the bacteria were Isolated by centrifugation and the cell pellet stored at 
-20X until use. 

10 g of £. CO// BL21 cell paste was defrosted in a falcon tube containing 25 ml 
50 mM HEPES, pH 7.2, 200 mlVI NaCI. The thawed cell paste was made up to 80 ml 
10 with 50 mM HEPES, pH 7.2. 200 mM NaCI and sonicated on ice 30 seconds on, 30 
seconds off for 10 cycles at a power of 22 microns ensuring the sample remained 
cool. The lysed cells were centrifuged at 18 000 rpm, 4°C for 30 minutes. The 
supernatant was loaded onto a 0.1 M NiS04 charged chelating cojumn (20-30 ml 
column is sufficient) and equilibrated with 50 mM HEPES, pH 7.2, 200 mM NaCI. 

15 

Using a step gradient of 1 0 and 40 mM imidazol, the non-specific bound protein was 
washed away and the fusion protein eluted with 1 00 mM imidazol. The eluted fusion 
protein was dialysed against 5 L of 50 mM HEPES, pH 7.2, 200 mM NaCI at 4''C 
overnight and the OD of the dialysed fusion protein measured. 1 unit of Factor Xa 
20 was added per 100 pg fusion protein and incubated at 25°C static overnight. The 
cleavage mixture was loaded onto a 0.1 M NISO4 charged Chelating column (20- 
30 ml column is sufficient) and equilibrated with 50 mM HEPES, pH 7.2, 200 mM 
NaCI. 

25 Using a step gradient of 1 0 and 40 mM imidazol, the non-specific bound protein was 
washed away and the fusion protein eluted with 1 00 mM Imidazol. The eluted fusion 
protein was dialysed against 5 L of 50 mM HEPES, pH 7.2, 200 mM NaCI at 4''C 
overnight and the fusion concentrated to about 2 mg/ml, aliquoted and stored at 
-20°C. 

30 

Figure 3 shows the SDS-PAGE analysis of expression and purification of N[1-17]- 
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Example 10 - Recombinant production of a single polypeptide fusion of 
(nociceptin 1-11)-LHn/B 

5 The DNA sequence for the (nociceptin 1-11)-LHn/B was designed by back 
translation of the LC/B, Hn/B, and nociceptin 1-11 amino acid sequences. The 
complete ORF containing the (nociceptini -1 1 )-LC/B-activation Ioop-Hn/B sequence 
was assembled within standard DNA sequence manipulation software (EditSeq). 
The activation loop between the LC/B cysteine and the Hn/B cysteine was modified 

10 to incorporate a Factor Xa protease recognition site. 

The recombinant fusion protein was then produced essentially as described in 
Example 9. 

IS Example i1~- Recombinant production of a single polypeptide fusion of 
(nociceptin N[[Y14]1-17]) -LHn/C (SEQ ID25 and SEQ ID26) 

The DNA sequence for the nociceptin N[[Y14]1-1 7] was designed by back translation 
of the LC/C, Hn/C, and nociceptin N[{Y14]1-17] amino acid sequences. The 
20 complete ORF containing the (nociceptin N[[Y14]1-1 7J)-LC/C-activation Ioop-Hn/C 
sequence was assembled within standard DNA sequence manipulation software 
(EditSeq). The activation loop between the LC/C cysteine and the Hn/C cysteine 
was modified to incorporate a Factor Xa protease recognition site. 

25 The recombinant fusion protein was then produced essentially as described in 
Example 9. 

Example 1 2 - Recombinant production of a single polypeptide fusion of LHn/C- 
30 (nociceptin 1-11) (SEQ ID23 and SEQ 1024) 

The DNA sequence for the LHN/C-(nociceptin 1-11) was designed by back 
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translation of the LC/C, Hn/C and noclceptin 1-11 amino acid sequences. The 
complete ORF (SEQ ID23) containing the LC/C-activation loop-HN/C-flexible spacer- 
(nociceptin 1-11) was assembled within standard DNA sequence manipulation 
software (EditSeq);- 

5 

The recombinant fusion protein (SEQ ID24) was then produced essentially as 
described In Example 9. 

Example 13 - Production of a conjugate for delivery of DNA encoding LC/C into 
10 a cell 

The construction of a noclceptln-HN-[LC/C] conjugate is described below, where 
[LC/C] represents the polylysine condensed DNA encoding the light chain of 
botullnum neurotoxin type C. 

15 

Materials 

SPDP is from Pierce Chemical Co. 

Additional reagents are obtained from Sigma Ltd. 

20 Methods 

Using a plasmid containing the gene encoding LC/C under the control of a CMV 
(Immediate early) promoter, condensation of DNA was achieved using SPDP- 
derivatised polylysine to a ratio of 2 DNA to 1 polylysine. Conjugates were then 
prepared by mixing condensed DNA (0.4 mg/ml) with HN-nociceptin (100 pg/ml)for 
25 1 6 h at 25''C. The SPDP-derivatised polylysine and the free -SH group present on 
the Hn domain combine to facilitate covalent attachment of the DNA and protein. 

Example 14 - Production of a conjugate for delivery of DNA encoding LC/B into 
a cell 

30 

The construction of a (nociceptin 1-11)-Hn-[LC/B] conjugate is described below, 
> wh'ere [LC/B] represents the polylysine condensed DNA encoding the light chain of 
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botulinum neurotoxin type B. 
Materials 

SPDP is from Pierce Chemical Co. 
s Additional reagents are obtained from Sigma Ltd. 

Methods 

Using a plasmid containing the gene encoding LC/B under the control of a CMV 
(immediate early) promoter, condensation of DNA was achieved using SPDP- 
10 derivatised polylysine to a ratio of 2 DNA to 1 polylysine. Conjugates were then 
prepared by mixing condensed DNA (0.4 mg/ml) with HN-(nociceptin 1-11) 
(100 |jg/ml) for 16 h at The SPDP-derivatised polylysine and the free -SH 
group present on the Hn domain combine to facilitate covalent attachment of the 
DNA and protein. 

15- 

Example 15 - Assessment of the activity of nociceptin-LHN/A in substance P 
releasing neuronal cells 

Using methodology described in Duggan ef a/., (2002, J. Biol. Chem., 277, 34846- 
20 34852), the activity of nociceptin-LHN/A in substance P releasing neuronal cells was 
assessed. 

Nociceptin-LHN/A fusion protein was applied to 2-week old dorsal root ganglia 
neuronal cultures, and incubated at 37X for 1 6 hours. Following the incubation, the 
25 media was removed and the ability of the cells to undergo stimulated release of 
substance P (SP) was assessed. 

The release of SP from the neuronal cells incubated with the nociceptin-LHN/A fusion 

protein was assayed in coniparison to (!) LHN/A-only treated cells and (ii) cells 
30 treated with media alone. This allowed the % inhibition of substance P from the 
eDRG to be calculated. The ability of the nociceptin-LHN/A fusion protein to inhibit 
L'^y.u ^SPreleasa (relatives celisst^at^^^ The ^ ^s^"" -^^ '^^ 
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data represent the mean of 3 determinations: 



Table 1 



-Test Material (mM) 


nociceptin-LHN/A fusion protein 


LHrj/A-only 


% Inhibition 


% inhibition 


1.0 


47.3 


25.6 


0.1 


13.8 


-11.5 



Example 16 - Confirmation of ORLi receptor activation by measuring 
forskolin-stimulated cAMP production 



Confimiation that a given TIVI is acting via the ORLi receptor is provided by the 
10 following test, in which the TMs ability to inhibit forskolin-stimulated cAMP production 
is assessed. 



15 



Materials 

[^H]adenine and [^"^CJt^MP are obtained from GE Healthcare 



Methods 

The test is conducted essentially as described previously by Meunier et al. [Isolation 
and structure of the endogenous agonist of opioid receptor-like ORLi receptor. 
Nature 377: 532-535, 1 995] in intact transfected-CHO cells plated on 24-well plastic 
20 plates. 

To the cells Is added [3H]adenine (1 .0 pCi) in 0.4 ml of culture medium. The cells 
remain at 37°C for 2 h to allow the adenine to incorporate into the intracellular ATP 
pool. After 2 h, the cells are washed once with incubation buffer containing: 1 30 mM 
25 NaCI, 4.8 mM KCI, 1 .2 mM KH2PO4, 1 .3 mM CaCh. 1 .2 mM MgS04, 10 mM glucose, 
1 mg/ml bovine semm albumin and 25 mM HEPES, pH 7.4, and replaced with buffer 
containing forskolin (10 pM) and isobutylmethylxanthine (50 pM) with or without the 
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TM of interest. After 10 min., the medium Is aspirated and replaced with 0.5 ml, 
0.2 M HCI. Approximately 1000 cpm of f "^CJcAMP is added to each well and used 
as an internal standard. The contents of the wells are then transferred to columns of 
-9.65 g dry-alumina powder. The columns are eluted with 4 ml of 5 mM HCI, 0.5 ml of 
5 0.1 M ammonium acetate, then two additional millilitres of ammonium acetate. The 
final eluate is collected into scintillation vials and counted for ^"^C and tritium. 
Amounts collected are corrected for recovery of [^*C]cAMP. TMs that are agonists 
at the ORLi receptor cause a reduction in the level of cAMP produced in response to 
forskolin. 

10 

Example 17 - Confirmation of ORLi receptor activation using a GlPyS 
binding functional assay 

Confirmation that a given TM is acting via the ORLi receptor is also provided by the 
IS following test, a GTPyS binding functional assay. 

Materials 

[35s]GTPyS is obtained from GE Healthcare 

Wheatgenn agglutlnin<»ated (SPA) beads are obtained from GE Healthcare 

20 

Methods, 

This assay is earned out essentially as described by Traynor and Nahorski 
[Modulation by p-opioid agonists of guanosine-5 -0-(3-pS]thio)triphosphate binding 
to membranes from human neuroblastoma SH-SY5Y cells. Mol. Pharmacol. 47: 848- 
25 854, 1995]. 

Cells are scraped from tissue culture dishes into 20 mM HEPES, 1 mM 
ethylenediaminetetraacetic acid, then centrifuged at 500 x gfor 10 min. Cells are 
resuspended in this buffer and homogenized with a Polytron Homogenizer. 

30 

The homogenate is centrifuged at 27,000 x gf for 15 min., and the pellet 
.. . ' . sr . .nes.usp,endQ^ iQ.l3uffetA, cQEjtaining: 20 mM HEPES» 10 mM MgCb, 1 oaraM NaCI, - . tm^-' 
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pH 7.4. The suspension Is recentrifuged at 20,000 ^ gand suspended once more In 
buffer A. For the binding assay, membranes (8-15 pg protein) are incubated with 
[35s]GTP S (50 pIVI), GDP (10 piVi), with and without the TM of interest, in a total 
-^0hame-of470-mlrfor 60nTiirh-at-25-Gr-Samples-are^ltered over glass fibre filters 
s and counted as described for the binding assays. 

Example 18 - Preparation of a LC/A and Hn/A backbone clones 

The following procedure creates the LC and Hn fragments for use as the component 
10 backbone for multidomain fusion expression. This example is based on preparation 
of a serotype A based clone (SEQ ID27 and SEQ ID28), though the procedures and 
methods are equally applicable to the other serotypes [illustrated by the sequence 
listing for serotype B (SEQ ID29 and SEQ ID30) and serotype C (SEQ ID31 and 
SEQ ID32)]. 

15 

Preparation of cloning and expression vectors 

pCR 4 (Invitrogen) is the chosen standard cloning vector, selected due to the lack of 
restriction sequences within the vector and adjacent sequencing primer sites for 
easy construct confirmation. The expression vector is based on the pMAL (NEB) 
20 expression vector, which has the desired restriction sequences within the multiple 
cloning site in the con'ect orientation for construct insertion (Sa/nHi-Sa/l-Psfl-H/ndlll). 
A fragment of the expression vector has been removed to create a non-mobilisable 
plasmid and a variety of different fusion tags have been inserted to increase 
purification options. 

25 . 

Preparation of protease (e.g. LC/A) insert 
The LC/A (SEQ ID27) is created by one of two ways: 
. The DNA sequence is designed by back translation of the LC/A amino acid 
sequence [obtained from freely available database sources such as GenBank 
30 (accession number PI 0845) or Swissprot (accession locus BXA1_CL0B0) using 
one of a variety of reverse translation software tools (for example EditSeq best E. 
.1,. - ' ^■i-"-<?r ■:.-yr-'r' »><aaj/r!aVteree;trattstetion:(DNASTARInC -fe? 
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BamHUSan recognition sequences are incorporated at the 5' and 3' ends 
respectively of the sequence, maintaining the correct reading frame. The DNA 
sequence is screened (using software such as MapDraw, DNASTAR Inc.) for 
restriction-enzyme cleavage -sequences incorporated during the back translation. 

5 Any cleavage sequences that are found to be common to those required by the 
cloning system are removed manually from the proposed coding sequence ensuring 
common E. coli codon usage is maintained. E. co// codon usage is assessed by 
reference to software programs such as Graphical Codon Usage Analyser 
(Geneart), and the %GC content and codon usage ratio assessed by reference to 

10 published codon usage tables (for example GenBank Release 143, 13 September 
2004). This optimised DNA sequence containing the LC/A open reading frame 
(ORF) Is then commercially synthesized (for example by Entelechon, Geneart or 
Sigma-Genosys) and is provided In the pCR 4 vector. 



15 -The alternative method is to use PCR amplification from an existing DNA sequence 
with BamHI and Sa/I restriction enzyme sequences incorporated into the 5* and 3' 
PCR primers respectively. Complementary oligonucleotide primers are chemically 
synthesised by a supplier (for example MWG or Sigma-Genosys), so that each pair 
has the ability to hybridize to the opposite strands (3' ends pointing "towards" each 

20 other) flanking the stretch of Clostridium target DNA, one oligonucleotide for each of 
the two DNA strands. To generate a PCR product the pair of short oligonucleotide 
primers specific for the Clostridium DNA sequence are mixed with the Clostridium 
DNA template and other reaction components and placed in a machine (the 'PCR 
machine') that can change the incubation temperature of the reaction tube 

25 automatically, cycling between approximately 94°C (for denaturation), 55''C (for 
oligonucleotide annealing), and 72°C (for synthesis). Other reagents required for 
amplification of a PCR product include a DNA polymerase (such as Taq or Pfu 
polymerase), each of the four nucleotide dNTP building blocks of DNA in equimolar 
amounts (50-200 pM) and a buffer appropriate for the enzyme optimised for Mg^* 

30 concentration (0.5-5 mM). 
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The amplification product is cloned into pCR 4 using either, TOPO TA cloning for 
Tag PCR products or Zero Blunt TOPO cloning for Pfu PCR products (both kits 
commercially available from Invitrogen). The resultant clone is checked by 
sequencing.' Any additional restriction sequences which are not-compatible with the 
s cloning system are then removed using site directed mutagenesis [for example, 
using Quickchange (Stratagene Inc.)]. 

Preparation of translocation (e.g. Hn) insert 

The Hn/A (SEQ ID28) is created by one of two ways: 

10 The DNA sequence is designed by back translation of the Hn/A amino acid 
sequence [obtained from freely available database sources such as GenBank 
(accession number P10845) or Swissprot (accession locus BXA1_CL0B0)] using 
one of a variety of reverse translation software tools [for example EditSeq best E. 
coli reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)]. 

15 A PstI restriction sequence added to the N-temninus and Xbal-stop codon-H/ndl II to 
the C-temninus ensuring the correct reading frame is maintained. The DNA 
sequence is screened (using software such as MapDraw, DNASTAR Inc.) for 
restriction enzyme cleavage sequences incorporated during the back translation. 
Any sequences that are found to be common to those required by the cloning 

20 system are removed manually from the proposed coding sequence ensuring 
common E. coli codon usage is maintained. E. coli codon usage is assessed by 
reference to software programs such as Graphical Codon Usage Analyser 
(Geneart), and the %GC content and codon usage ratio assessed by reference to 
published codon usage tables (for example GenBank Release 143, 13 September 

25 . 2004). This optimised DNA sequence is then commercially synthesized (for 
example by Entelechon, Geneart or Sigma-Genosys) and is provided in the pCR 4 
vector. 

The alternative method is to use PCR amplification from an existing DNA sequence 
30 with Pst\ and Xbal-stop codon-H/ndlll restriction enzyme sequences incorporated 
into the 5' and 3' PCR primers respectively. The PCR amplification is perfomied as 
; described above.- rThe^BCR product-Js Inaeited into pCR4 vectorand checked by 
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sequencing. Any additional restriction sequences which are not compatible with the 
cloning system are then removed using site directed mutagenesis [for example using 
Quickchange (Stratagene Inc.)]. 

5 Example 19 - Preparation of a LC/A-nociceptin-HN/A fusion protein (nociceptin 
is N-terminal of tlie HN-chain) 

Preparation of linker-nociceptin-spacer insert 

The LC-Hn linker can be designed from first principle, using the existing sequence 

10 infomiation for the linker as the template. For example, the serotype A linker (in this 
case defined as the inter-domain polypeptide region that exists between the 
cysteines of the disulphide bridge between LC and Hn) is 23 amino acids long and 
has the sequence VRGIITSKTKSLDKGYNKALNDL. Within this sequence, it is 
understood that proteolytic activation in nature leads to an Hn domain that has an N- 

15 terminus of the sequence ALNDL. This sequence infomiation is freely available from 
available database sources such as GenBank (accession number P10845) or 
Swissprot (accession locus BXA1_CL0B0). Into this linker a Factor Xa site, 
nociceptin and spacer are incorporated; and using one of a variety of reverse 
translation software tools [for example EditSeq best E. coli reverse translation 

20 (DNASTAR Inc.), or Backtfanslation tool v2.0 (Entelechon)], the DNA'sequence 
encoding the linker-ligand-spacer region is determined. Restriction sites are then' 
incorporated into the DNA sequence and can be arranged as BamHI-Sa/l-llnker- 
protease site-nociceptin-A//7el-spacer-Spel-Psfl-Xjbal-stop codon-H/ndlll (SEQ ID33). 
It is important to ensure the conect reading frame is maintained for the spacer, 

25 nociceptin and restriction sequences and that the Xba\ sequence is not preceded by 
the bases, TC, which would result on DAM methylation. The DNA sequence is 
screened for restriction sequence incorporation, and any additional sequences are 
removed manually from the remaining sequence ensuring common E. coli codon 
usage is maintained. E. coli codon usage is assessed by reference to software 

30 . programs such as Graphical Codon Usage Analyser (Geneart), and the %GC 
content and codon usage ratio assessed by reference to published codon usage 
-v. . ^rtai3\esifmexm0¥^ This optimised ^h^^^fji^' 
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DNA sequence is then commercially synthesized (for example by Entelechon, 
Geneart or Sigma-Genosys) and is provided in the pCR 4 vector. 

-Preparation ofthe-hG/A-noeieeptin-HN/A-fusion 

5 In order to create the LC-linker-nociceptin-spacer-HN construct (SEQ ID39), the pCR 
4 vector encoding the linker (SEQ ID33) is cleaved with SamHI Sail restriction 
enzymes. This cleaved vector then serves as the recipient vector for insertion and 
ligation of the LC/A DNA (SEQ ID27) cleaved with SamHI + Sa/I. The resulting 
plasmid DNA is then cleaved with Psfl + Xba\ restriction enzymes and serves as the 

10 recipient vector for the insertion and ligation of the Hn/A DNA (SEQ ID28) cleaved 
with Psfl + Xbal. The final construct contains the LC-linker-nociceptin-spacer-HN 
ORF (SEQ ID39) for transfer into expression vectors for expression to result in a 
fusion protein of the sequence illustrated in SEQ ID40. 

IS Example 20 - Preparation of a nociceptin-LC/A-HN/A fusion protein (nociceptin 
is N-terminai of the LC-chain) 

The LC/A-Hn/A backbone is constructed as described in Example 19 using the 
synthesised A serotype linker with the addition of a Factor Xa site for activation, 

20 ananged as SamHI-Sa/l-linker-protease site-linker-Psfl-Xbal-stop codon-H/ndlll 
(SEQ ID34). The LC/A-Hn/A backbone and the synthesised N-tenminal presentation 
nociceptin insert (SEQ ID35) are cleaved with SamHI + HindlW restriction enzymes, 
gel purified and ligated together to create a nociceptin-spacer-LC-iinker-HN. The 
ORF (SEQ ID41) is then cut out using restriction enzymes Aval + Xbal for transfer 

25 Into expression vectors for expression to result in a fusion protein of the sequence 
Illustrated In SEQ ID42. 

Example 21 - Preparation of a LC/C-nociceptin-HN/C fusion protein 

30 Following the methods used in Examples 1 and 2, the LC/C (SEQ ID31 ) and Hn/C 
(SEQ ID32) are created and inserted into the C serotype linker arranged as BamHl- 
^. > v r eSs/WlKikefKpeateas© site^TK)ck:eptiBW*s^el-sp8c*6r-Sp6l7Psfl^-j>^i)ai-s codon-H/n^Jil . ■'f^^;:::'u'% 
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(SEQ ID36). The final construct contains the LC-linker-nociceptin-spacer-HN ORF 
(SEQ ID43) for expression as a protein of the sequence illustrated In SEQ ID44. 



Example 22 - Preparation of a LC/C-nociceptin-HN/C fusion protein with a 
5 serotype A activation sequence 



Following the methods used In Examples 1 and 2, the LC/C (SEQ ID31) and Hn/C 
(SEQ ID32) are created and Inserted Into the A serotype linker arranged as BamHl- 
Sa/l-linker-protease site-nociceptin-A//7el-spacer-Spel-Psfl-X6al-stopcodon-H//7dlll 
10 (SEQ ID33). The final construct contains the LC-linker-nociceptin-spacer-HN ORF 
(SEQ ID45) for expression as a protein of the sequence illustrated In SEQ ID46. 



Example 23 - Preparation of a LC/A-met enl(ephaiin-HN/A fusion protein 



15 -Due to the small, five-amino acid, size of the met-enkephalin ligand the LC/A-met 
enkephalin-HN/A fusion is created by site directed mutagenesis [for example using 
Quickchange (Stratagene Inc.)] using the LC/A-noclceptin-HN/A fusion (SEQ ID39) 
as a template. Oligonucleotides are designed encoding the YGGFM met-enkephalin 
peptide, ensuring standard E.coli codon usage Is maintained and no additional 

20 restriction sites are incorporated , flanked by sequences complimentary to the linker 
region of the LC/A-noclceptin-HiM/A fusion (SEQ ID39) either side on the nociceptin 
section. The SDM product Is checked by sequencing and the final constmct 
containing the LC-linker-met enkephalin-spacer-HN ORF (SEQ I D47) for expression 
as a protein of the sequence illustrated in SEQ ID48. 



25 



Example 24 - Preparation of a LC/A-p endorphin-HN/A fusion protein 



Following the methods used in Examples 1 and 2, the LC/A (SEQ ID27) and Hn/A 
(SEQ ID28) are created and inserted into the A serotype p endorphin linker arranged 
30 as SamHI-Sa/l-linker-protease site-p endorphin-AZ/iel-spacer-Spel-Psfl-Xibal-stop 
codon-Mndlll (SEQ ID37). The final construct contains the LC-linker-p endorphin- 
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spacer-Hw ORF (SEQ ID49) for expression as a protein of the sequence illustrated In 
SEQ ID50. 

-Example 25- Preparation of-a LC/A-nociceptin variant-HN/A fusion protein 

5 

Following the methods used in Examples 1 and 2, the LC/A (SEQ ID27) and Hu/A 
(SEQ ID28) are created and inserted into the A serotype nociceptin variant linker 
arranged as BamHI-Sa/l-linl<er-protease site-nociceptin variant-A//7el-spacer-Spel- 
Psfl-XZ)al-stop codon-H/ndlll (SEQ ID38). The final construct contains the LC- 
10 linl<er-nociceptin variant-spacer-HN ORF (SEQ ID51 ) for expression as a protein of 
the sequence illustrated in SEQ ID52. 

Example 26 - Purification method for LC/A-nociceptin-Hn/A fusion protein 

15 --Defrost-falconr1obd-containing-25-rn1-5&-mM-HEPES pH-7:2;-200-m1VI NaCI and 
approximately 1 0 g of E. coli BL21 cell paste. Make the thawed cell paste up to 80 
ml with 50 mM HEPES pH 7.2, 200 mM NaCI and sonicate on ice 30 seconds on, 30 
seconds off for 10 cycles at a power of 22 microns ensuring the sample remains 
cool. Spin the lysed cells at 18 000 rpm, 4*^0 for 30 minutes. Load the supernatant 

20 onto a 0.1 M NiS04 charged Chelating column (20-30 ml column is sufficient) 
equilibrated with 50 mM HEPES pH 7.2, 200 mM NaCI. Using a step gradient of 10 
and 40 mM imidazol, wash away the non-specific bound protein and elute the fusion 
protein with 100 mM imidazol. Dialyse the eluted fusion protein against 5 L of 50 
mM HEPES pH 7.2, 200 mM NaCI at 4°C overnight and measure the CD of the 

25 dialysed fusion protein. Add 1 unit of factor Xa per 100 |jg fusion protein and 
Incubate at 25*'C static overnight. Load onto a 0.1 M NiS04 charged Chelating 
column (20-30 ml column is sufficient) equilibrated with 50 mM HEPES pH 7.2, 200 
mM NaCI. Wash column to baseline with 50 mM HEPES pH 7.2, 200 mM NaCI. 
Using a step gradient of 10 and 40 mM imidazol, wash away the non-specific bound 

30 protein and elute the fusion protein with 1 00 mM imidazol. Dialyse the eluted fusion 
protein against 5 L of 50 mM HEPES pH 7.2, 200 mM NaCI at 4*C overnight and 
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concentrate the fusion to atxjut 2 mg/ml, aliquot sample and freeze at -20°C. Test 
purified protein using OD, BCA, purity analysis and SNAP-25 assessments. 

~Examplef^7-=^~Preparation of^L-C/A-nociceptin4lN^^ proteirr(nociceptm 
5 is N-terminal of the HN-chain) 

The linker-nociceptin-spacer insert is prepared as described in Example 19. 

Preparation of the LC/A-nociceptin-Hr/A fusion 

10 In order to create the LC-linker-nociceptin-spacer-HN construct (SEQ ID39), the pCR 
4 vector encoding the linker (SEQ ID33) is cleaved with SamHi ■•- Sa/I restriction 
enzymes. This cleaved vector then serves as the recipient for insertion and ligation 
of the LC/A DNA (SEQ ID27) also cleaved with SamHI + Sa/I. The resulting plasmid 
DNA is then cleaved with SamHI + HindWl restriction enzymes and the LC/A-linker 

15 ■ fragment inserted Into a similarly cleaved vector containing a unique multiple cloning 
site for SamHI, Sa/I, Psti, and HindWl such as the pMAL vector (NEB). The Hn/A 
DNA (SEQ ID28) is then cleaved with Pstt + HindWl restriction enzymes and inserted 
into the similarly cleaved pMAL-LC/A-linker construct. The final construct contains 
the LC-linker-nociceptin-spacer-HN ORF (SEQ ID39) for expression as a protein of 

20 the sequence illustrated in SEQ ID40. 

Example 28- Preparation of a noclceptin-LC/A-HN/A fusion protein (nociceptin 
is N-terminal of the LC-chain) 

25 In order to create the nociceptin-spacer-LC/A-HN/A construct, an A serotype linker 
with the addition of a Factor Xa site for activation, an-anged as SamHI-Sa/l-llnker- 
protease site-linker-Psfl-Xfcal-stop codon-H//idlll (SEQ ID34) is syntheslsed as 
described in Example 27. The pCR 4 vector encoding the linker is cleaved with 
SamHI + Sa/I restriction enzymes. This cleaved vector then serves as the recipient 

30 . for insertion and ligation of the LC/A DNA (SEQ ID27) also cleaved with SamHI + 
Sa/I. The resulting plasmid DNA is then cleaved with SamHI + HindWl restriction 
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enzymes and the LC/A-linker fragment inserted into a similarly cleaved vector 
containing tlie synthesised N-terminal presentation nociceptin insert (SEQ ID35). 
This construct is then cleaved with Aval + HindWl and inserted into an expression 
— veet0r-suchas^he^Ab1^lasmid(^^IEB^T4^e+^^l/A■BNA-(SEQ-ID28> cleaved 
s with PsU + Hindi 1 1 restriction enzymes and inserted into the similarly cleaved pIVIAL- 
nociceptin-LC/A-linker construct. The final construct contains the nociceptin-spacer- 
LC/A-Hn/A ORF (SEQ ID63) for expression as a protein of the sequence illustrated 
■in SEQ ID64. 

10 Example 29 - Preparation and purification of an LC/A-nociceptin-HN/A fusion 
protein family with variable spacer length 

Using the same strategy as employed in Example 19, a range of DNA linkers were 
prepared that encoded nociceptin and variable spacer content. Using one of a 

15 variety of reverse Iranslation software tools [for example EditSeq bestE. co//reverse 
translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)], the DNA 
sequence encoding the linker-llgand-spacer region is detenmined. Restriction sites 
are then incorporated into the DNA sequence and can be an-anged as BamH\-Sal\- 
linker-protease site-nociceptin-Atf?el-spacer-Spel-Psfl-X/>al-stop codon-H/ndlll (SEQ 

20 ID65 to SEQ iD69). It is important to ensure the conrect reading frame is maintained 
for the spacer, nociceptin and restriction sequences and that the Xba\ sequence is 
not preceded by the bases, TC which would result on DAM methylation. The DNA 
sequence is screened for restriction sequence incorporation and any additional 
sequences are removed manually from the remaining sequence ensuring common 

25 E. CO// codon usage is maintained. £. co// codon usage is assessed by reference to 
software programs such as Graphical Codon Usage Analyser (Geneart), and the 
%GC content and codon usage ratio assessed by reference to published codon 
. usage tables (for example GenBank Release 143, 13 September 2004). This 
optimised DNA sequence is then commercially synthesized (for example by 

30 Entelechon, Geneart or Sigma-Genosys) and is provided in the pCR 4 vector. 

„ The spacers that were created included: r «. - 



wo 2006/059105 



PCT/GB2005/004598 



79 



Table 2 



Code 


Protein sequence of the linker 


SEQ ID of the 
-linker-DNA 






GS10 


ALAGGGGSALVLQ 


53 


GS15 


ALAGGGGSGGGGSALVLQ 


54 


GS25 


ALAGGGGSGGGGSGGGGSGGGGSALVLQ 


55 


GS30 


ALAGGGGSGGGGSGGGGSGGGGSGGGGSALVLQ 


56 


HX27 


ALAAEAAAKEAAAKEAAAKAGGGGSALVLQ 


57 



By way of example, In order to create the LC/A-CPN(GS15>-Hn/A fusion constnjct 
5 (SEQ ID70), the pCR4 vector encoding the linker (SEQ ID66) is cleaved with eamHI 
+ Sa/I restriction enzymes. This cleaved vector then serves as the recipient vector 
for insertion and ligation of the LC/A DNA (SEQ ID27) also cleaved with BamHI + 
"Sa/I. The resulting plasmid DNA is then cleaved with BamHI + HindWl restriction 
enzymes and the LC/A-linker fragment inserted into a similariy cleaved vector 
10 containing a unique multiple cloning site for BamHI, Sa/I, Ps% and HmdIII such as 
the pMAL vector (NEB). The Hn/A DNA (SEQ ID28) is then cleaved with Pstt + 
HindiW restriction enzymes and inserted into the similariy cleaved pMAL-LC/A-linker 
constmct. The final construct contains the LC/A-CPN(GS15)-Hn/A ORF (SEQ ID70) 
for expression as a protein of the sequence illustrated in SEQ ID71. 

15 

As a further example, to create the LC/A-CPN(GS25)-Hn/A fusion construct (SEQ 
ID72), the pCR 4 vector encoding the linker (SEQ ID67) is cleaved with fiamHI + 
Sa/I restriction enzymes. This cleaved vector then serves as the recipient vector for 
insertion and ligation of the LC/A DNA (SEQ ID27) cleaved with BamHI + Sa/I. The 
20 resulting plasmid DNA is then cleaved with BamHI + Hind\\\ restriction enzymes and 
the LC/A-linkerfragment inserted into a similariy cleaved vector containing a unique 
multiple cloning site for BamHI, Sa/I, Pst\, and HindWl such as the pMAL vector 
(NEB). The Hn/A DNA (SEQ ID28) is then cleaved with Pst\ + HindWl restriction 
enzymes and inserted into the similarly cleaved pMAL-LC/A-linker construct. The 
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final construct contains the LC/A-CPN(GS25)-Hn/A ORF (SEQ ID72) for expression 
as a protein of tlie sequence illustratecl in SEQ ID73. 

--Varlante-of-the tG/A-GPN-HN/A-fusion eonsisting-of GS1€, ^S30 and-HX27 are 
s similarly created. Using tiie purification methodology described in Example 26, 
fusion protein is purified from E. co// cell paste. Figure 12 illustrates the purified 
product obtained in the case of LC/A-CPN(GS10)-Hn/A, LC/A-CPN(GS15)-Hn/A, 
LC/A-CPN(GS25)-Hn/A, LC/A-CPN(GS30)-Hn/A and LC/A-CPN(HX27)-Hn/A. 

10 Example 30 - Assessment of in vitro efTicacy of an LC/A-nociceptin-HN/A 
fusion 

Fusion protein prepared according to Examples 2 and 9 was assessed in the eDRG 
neuronal cell model. 

Assays for the inhibition of substance P release and cleavage of SNAP-25 have 
been previously reported (Duggan etal., 2002, J. Biol. Chem., 277, 34846-34852). 
Briefly, dorsal root ganglia neurons are harvested from 15-day-old fetal Sprague- 
Dawley rats and dissociated cells plated onto 24-well plates coated with Matrigel at a 

20 density of 1 x 1 0^ cells/well. One day post-plating the cells are treated with 1 0 pM 
cytosine p-D-arabinofuranosidefor48 h. Cells are maintained in Dulbecco's minimal 
essential medium supplemented with 5% heat-inactivated fetal bovine serum, 5 mM 
L-glutamine, 0.6% D-glucose, 2% B27 supplement, and 100 ng/ml 2.5S mouse 
nerve growth factor. Cultures are maintained for 2 weeks at 37°C in 95% air/5% 

25 CO2 before addition of test materials. 

Release of substance P from eDRG is assessed by enzyme-linked immunosorbent 
assay. Briefly, eDRG cells are washed twice with low potassium-balanced salt 
solution (BSS: 5 mM KCI, 137 mM NaCI, 1.2 mM MgCb, 5 mM glucose, 0.44 mM 
30 KH2PO4, 20 mM HEPES, pH 7.4, 2 mM CaCb). Basal samples are obtained by 
incubating each well for 5 min. with 1 ml of low potassium BSS. After removal of this 
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buffer, the cells are stimulated to release by incubation with 1 ml of high potassium 
buffer (BSS as above with modification to include 1 00 mM KCI isotonically balanced 
with NaCI) for 5 min. All samples are removed to tubes on ice prior to assay of 
substance P. Total cell lysates are prepared by addition of 250iil-of 2 M acetic 

5 acld/0.1% trifluoroacetic acid to lyse the cells, centrifugal evaporation, and 
resuspension in 500 pi of assay buffer. Diluted samples are assessed for substance 
P content. Substance P immunoreactivity is measured using Substance P Enzyme 
Immunoassay Kits (Cayman Chemical Company or R&D Systems) according to 
manufacturers' instructions. Substance P is expressed in pg/ml relative to a 

10 standard substance P curve run in parallel. 

SDS-PAGE and Western blot analysis were performed using standard protocols 
(Novex). SNAP-25 proteins were resolved on a 12% Tris/glydne polyacrylamide gel 
(Novex) and subsequently transfened to nitrocellulose membrane. The membranes 

15 were probed with a monoclonal antibody (SMI-81 ) that recognises cleaved and intact 
SNAP-25. Specific binding was visualised using peroxidase-conjugated secondary 
antibodies and a chemiluminescent detection system. Cleavage of SNAP-25 was 
quantified by scanning densitometry (Molecular Dynamics Personal SI, ImageQuant 
data analysis software). Percent SNAP-25 clea\^ge was calculated according to the 

20 fomiula: (Cleaved SNAP-25/(Cleaved+lntact SNAP-25))x1 00. 

Following exposure of eDRG neurons to an LC/A-nociceptin-HN/A fusion (temned 
CPN-A), both inhibition of substance P release and cleavage of SNAP-25 are 
observed (Figure 13). After 24 h exposure to the fusion, 50% of maximal SNAP-25 
25 cleavage is achieved by a fusion concentration of 6.3±2.5 nM. 

The effect of the fusion is also assessed at defined time points following a 16 h 
exposure of eDRG to CPN-A. Figure 14 illustrates the prolonged duration of action 
of the CPN-A fusion protein, with measurable activity still being observed at 28 days 
30 post exposure.. 
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Example 31 - Assessment of in vitro efficacy of an LC/A-nociceptin variant- 
Hn/A fusion 

' Fusion-protein prepared-according to Examples-8 and 9 was assessed in the eDRG 
5 neuronal cell mode using the method described in Example 30, 

Following exposure of eDRG neurons to an LC/A-nociceptin variant-HN/A fusion 
(tenmed CPNv-A), both inhibition of substance P release and cleavage of SNAP-25 
are observed. After 24 h exposure to the fusion, 50% of maximal SNAP-25 cleavage 
10 is achieved by a fusion concentration of 1 .4±0.4 nM (Figure 15). 

The effect of the fusion is also assessed at defined time points following a 16 h 
exposure of eDRG to CPN-A. Figure 16 illustrates the prolonged duration of action 
of the CPN-A fusion protein, with measurable activity still being observed at 24 days 
15 post exposure. 

The binding capability of the CPNv-A fusion protein is also assessed in comparison 
to the CPN-A fusion. Figure 1 7 illustrates the results of a competition experiment to 
determine binding efficacy at the ORH receptor. CPNv-A is demonstrated to 
20 displace [3H]-nociceptin, thereby confirming that access to the receptor is possible 
with the ligand in the central presentation fomnat. 

Example 32 - Preparation of an LC/A-nociceptIn variant-HN/A fusion protein 
that is activated by treatment with Enterol^inase 

25 . 

Following the methods used in Examples 1 and 2, the LC/A (SEQ ID27) and Hn/A 
(SEQ ID28) are created and inserted into the A serotype nociceptin variant linker 
arranged as BamHI-Sa/l-llnker-enterokinase protease site-nociceptin variant-A//?el- 
spacer-Spel-Psfl-Xbal-stop codon-H/ndlll (SEQ ID74). The final construct contains 
30 the LC-linker-nociceptin variant-spacer-HN ORF sequences (SEQ ID75) for 
expression as a protein of the sequence illustrated in SEQ ID76. The fusion protein 
is termed CPNv(Ek)-A. Figure 1 8 illustrates the purification of CPNv(Ek)-A from £ 
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co//following the methods used in Example 26 but using Enterokinasefor activation 
at 0.00064 |jg per 100 [ig effusion protein. 

-Example 33 - Assessment-of in vitro efficacy of an LC/A-nociceptin variant- 
Hn/A fusion that has been activated by treatment with enterokinase 

The CPNv(Ek)-A prepared in Example 32 is obtained in a purified fonn and applied 
to the eDRG cell model to assess cleavage of SNAP-25 (using methodology ftx)m 
Example 30). Figure 1 9 illustrates the cleavage of SNAP-25 following 24 h exposure 
of eDRG to CPNv(Ek)-A. The efficiency of cleavage is observed to be similar to that 
achieved with the Factor Xa-cleaved material, as recorded in Example 31. 

Example 34 - Preparation of an LC/C-nociceptin variant-HN/C fusion protein 
with a Factor Xa activation linker derived from serotype A 

Following the methods used in Example 21, the LC/C (SEQ ID31 ) and Hn/C (SEQ 
ID32) are created and inserted into the A serotype nociceptin variant linker an-anged 
as BamHI-Sa/l-linker-nociceptin variant-A//7el-spacer-Spel-Psfl-Xibal-stop codon- 
H/ndlll (SEQ ID77). The final construct contains the LC-linker-nociceptin variant- 
spacer-HN ORF sequences (SEQ ID78) for expression as a protein of the sequence 
illustrated in SEQ ID79. The fusion protein is temried CPNv-C (act. A). Figure 20 
illustrates the purification of CPNv-C.(act. A) from E. coli following the methods used 
in Example 26. 

Example 35 - Assessment of in vitro efficacy of an LC/C-nociceptin variant- 
Hn/C fusion protein 

Following the methods used in Example 26, the CPNv-C (act. A) prepared in 
Example 34 is obtained in a purified form and applied to the eDRG cell model to 
assess cleavage of SNAP-25 (using methodology from Example 30). After 24 h 
exposure to the fusion, 50% of maximal syntaxin cleavage is achieved by a fusion 
-cpQpeqtration.of 3,J ±M.nM. figi^Fg^l Ulustr3tes,tbefileavage-of ^taxin following 
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24 h exposure of eDRG to CPNv-C (act. A). 

Example 36 - Assessment of in vivo efficacy of an LC/A-nociceptin-HN/A 
-fusion - 

5 

Tlie ability of an LC/A-nodceptin- Hn/A fusion (CPN/A) to iniiibit acute capsaicin- 
induced mechanical allodynia is evaluated following subcutaneous intraplantar 
injection in the rat hind paw. Test animals are evaluated for paw withdrawal 
frequency (PWF%) in response to a 10 g Von Frey filament stimulus series (10 

10 stimuli X 3 trials) prior to recruitment into the study, after subcutaneous treatment 
with CPN/A but before capsaicin, and following capsaicin challenge post-injection of 
CPN/A (average of responses at 15' and 30'). Capsaicin challenge is achieved by 
injection of 10 pL of a 0.3% solution. Sample dilutions are prepared in 0.5% 
BSA/saline. Figure 22 illustrates the reversal of mechanical allodynia that is 

15 achieved by pre-treatment of the animals with a range of concentrations of LC/A- 
nociceptin-HN/A fusion. 

The ability of an LC/A-nociceptin-HN/A fusion (CPN/A) to inhibit streptozotocin 
(STZ)T-induced mechanical (tactile) allodynia in rats is evaluated. STZ-induced 

20 mechanical allodynia in rats is achieved by injection of streptozotocin (i.p. or i.v.) 
which y\e\(is destruction of pancreatic p-cells leading to loss of insulin production, 
with concomitant metabolic stress (hyperglycemia and hyperlipidemia). As such, 
STZ induces Type I diabetes. In addition, STZ treatment leads to progressive 
development of neuropathy, which serves as a model of chronic pain with 

25 hyperalgesia and allodynia that may reflect signs observed in diabetic humans 
(peripheral diabetic neuropathy). 

. Male Sprague-Dawley rats (250-300 g) are treated with 65 mg/kg STZ in citrate 
buffer (I.V.) and blood glucose and lipid are measured weekly to define the 
30 readiness of the model. Paw Withdrawal Threshold (PWT) is measured in response 
to a Von Frey filament stimulus series over a period of time. Allodynia is said to be 
established'When-1he-F¥l?^=6h''two.e^^ by 1 week) 
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10 



measures below 6 g on the scale. At this point, rats are randomized to either a 
saline group (negative efficacy control), gabapentin group (positive efficacy control) 
or a test group (CPN/A). Test materials (20-25 pi) are injected subcutaneously as a 
-single-injection -(exeept-gabapentin) and the -PWT-is- measttred at-1 day-post- 
treatment and periodically thereafter over a 2-week period. Gabapentin (30 mg/kg 
i.p. @ 3 ml/kg injection volume) Is injected daily, 2 hours prior to the start of PWT 
testing. Figure 23 illustrates the reversal of allodynia achieved by pre-treatment of 
the animals with 750 ng of CPN/A. Data were obtained over a 2-week period after a 
single injection of CPN/A 

Example 37 - /^sessment of in vivo efricacy of an LC/A-nociceptin variant- 
Hn/A fusion 



The ability of an LC/A-nociceptin variant-HN/A fusion (CPNv/A) to inhibit capsalcln- 
15 induced mechanical allodynia is evaluated following subcutaneous intraplantar 
injection in the rat hind paw. Test animals are evaluated for paw withdrawal 
frequency (PWF%) in response to a 10 g Von Frey filament stimulus series (10 
stimuli X 3 trials) prior to recruitment into the study (Pre-Treat); after subcutaneous 
Intraplantar treatment with.CPNv/A but before capsaicin (Pre-CAP); and following 
20 capsaicin challenge post-Injection of CPN v/A (average of responses at 1 5' and 30'; 
CAP). Capsaicin challenge is achieved by Injection of 10 pL of a 0.3% solution. 
Sample dilutions are prepared in 0.5% BSA/sallne. 

Figure 24 illustrates the reversal of allodynia that is achieved by pre-treatment of 
25 the animals with a range of concentrations of LC/A-nociceptin variant-HN/A fusion in 
comparison to the reversal achieved with the addition of LC/A-nociceptin-HN/A 
fusion. These data are expressed as a nomnalized paw. vwthdrawal firequency 
differential, in which the difference between the peak response (post-capsaicin) and 
the baseline response (pre-capsaicin) is expressed as a percentage. With this 
30 analysis. It can be seen that CPNv/A is more potent than CPN/A since a lower dose 
of CPNv/A is required to achieve similar analgesic effect to that seen with CPN/A. 
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Example 38 - Preparation of an LC/A-leu enkephalin-HN/A fusion protein 



Due to the small, five-amino acid, size of the leu-enkephalln llgand the LC/A-leu 
-enkephalin-HN/A fusion is created by site directed mutagenesis [for example using 
5 Quickchange (Stratagene Inc.)] using the LC/A-nociceptin-HN/A fusion (SEQ I D39) 
as a template. Oligonucleotides are designed encoding the YGGFL leu-enkephalln 
peptide, ensuring standard E. coli codon usage is maintained and no additional 
restriction sites are incorporated, flanked by sequences complimentary to the linker 
region of the LC/A-nociceptin-HN/A fusion (SEQ ID39) either side on the nociceptin 
10 section. The SDM product is checked by sequencing and the final construct 
containing the LC-linker-leu enkephalin-spacer-HN ORF (SEQ ID80) for expression 
as a protein of the sequence illustrated in SEQ 1081 . The fusion protein is tenned 
CPLE-A. Figure 25 illustrates the purification of CPLE-A from £. co// following the 
methods used In Example 26. 

15 

Example 39 - Expression and purification of an LC/A-beta-endorphin-l-iN/A 
fusion protein 

. FolkSwing the methodsused in Example 26, and with the LC/A-beta-endorphin-HN/A 
20 fusion protein (tenmed CPBE-A) created in Example 24, the CPBE-A Is purified from 
E. coli. Figure 26 illustrates the purified protein as analysed by SDS-PAGE. 

Example 40 - Preparation of an LC/A-nociceptin mutant-HN/A fusion protein 



25 Due to the single amino acid modification necessary to mutate the nociceptin 
sequence at position 1 from a Phe to a Tyr, the LC/A-nociceptin mutant-HN/A fusion 
is created by site directed mutagenesis [for example using Quickchange (Stratagene 
Inc.)] using the LC/A-nociceptin-HN/A fusion (SEQ ID39) as a template. 
Oligonucleotides are designed encoding tyrosine at position 1 of the nociceptin 

30 sequence, ensuring standard E. coli codon usage is maintained and no additional 
restriction sites are incorporated, flanked by sequences complimentary to the linker 
J^g«PO<iftU® '^Q/'f^^nQcige^tin-HN/A fusion (SEQJD39)eit ^^^"-i^.^- 
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section. The SDM product is checked by sequencing and the final constmct 
containing the LC/A-noclceptin mutant-spacer-HN/A fusion ORF (SEQ ID82) for 
expression as a protein of the sequence illustrated in SEQ ID83. The fusion protein 
Hs termed-GP0P-A. Figure-2-7- illustrates the purlfication-of CPOP-A from E. coli 
5 following the methods used in Example 26. 

Example 41 - Preparation and assessment of an LC/A-nociceptin variant 
mutant-HN/A fusion protein 

10 Due to the single amino acid modification necessary to mutate the nociceptin 
sequence at position 1 from a Phe to a Tyr, the LC/A-nociceptIn variant mutant-HN/A 
fusion Is created by site directed mutagenesis [for example using QuIclcchange 
(Stratagene Inc.)] using the LC/A-noclceptin variant-HN/A fusion (SEQ ID51 ) as a 
template. Oligonucleotides are designed encoding tyrosine at position 1 of the 

15 nociceptin sequence, ensuring standard E. co//-codon usage Is maintained and no 
additional restriction sites are incorporated, flanked by sequences complimentary to 
the linker region of the LC/A-nociceptin varlant-HN/A fusion (SEQ ID51) either side 
on the nociceptin section. The SDM product Is checked by sequencing and the final 
constmct containing the LC/A-nociceptin mutant-spacer-HN/A fusion ORF (SEQ 

20 ID84) for expression as a protein of the sequence illustrated In SEQ ID85. The 
fusion protein is ternied CPOPv-A. Figure 28 illustrates the purification of CPOPv-A 
from E. CO// following the methods used in Example 26. 

Using methodology described in Example 30, CPOPv-A is assessed for its ability to 
25 cleave SNAP-25 In the eDRG cell model. Figure 29 illustrates that CPOPv-A is able 
to cleave SNAP-25 in the eDRG model, achieving cleavage of 50% of the maximal 
SNAP-26 after exposure of the cells to approximately 5.9 nM fusion for 24 h. 

Example 42 - Preparation of an IgA protease-nociceptin variant-HN/A fusion 
30 protein 
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sources such as GenBank (accession number P09790). Infonnation regarding the 
structure of the N. Gonorrhoeae IgA protease gene is available in the literature 
(Pohlner et al., Gene structure and extracellular secretion of Neisseria gonorrhoeae 
-IgA-protease, Nature, •1987,-325(6-1G3),-458-62).- -Using Backtranslation tool v2.0 

5 (Entelechon), the DNA sequence encoding the IgA protease modified for E. coli 
expression was detemiined. A 6amHi recognition sequence was incorporated at the 
5' end and a codon encoding a cysteine amino acid and Sail recognition sequence 
were Incorporated at the 3' end of the IgA DNA. The DNA sequence was screened 
using MapDraw, (DNASTAR Inc.) for restriction enzyme cleavage sequences 

10 incorporated during the back translation. Any cleavage sequences that are found to 
be common to those required for cloning were removed manually from the proposed 
coding sequence ensuring common E. coli codon usage Is maintained. E. coli 
codon usage was assessed Graphical Codon Usage Analyser (Geneart), and the 
%GC content and codon usage ratio assessed by reference to published codon 

15 usage tables. This optimised DNA sequence (SEQ ID86) containing the IgA open 
reading frame (ORF) is then commercially synthesized. 



The IgA (SEQ ID86) is inserted into the LC-linker-nociceptin variant-spacer-HN ORF 
(SEQ ID51 ) using BamH\ and Sell restriction enzymes to replace the LC with the IgA 
20 protease DNA. The final construct contains the IgA-llnker-nodceptin variant-spacer- 
HN ORF (SEQ ID87) for expression as a protein of the sequence Illustrated In SEQ 
ID88. 

Example 43 - Preparation and assessment of a nociceptin targeted 
25 endopeptldase fusion protein with a removable histidine purification tag. 

DNA was prepared that encoded a Factor Xa removable his-tag (his6), although it is 
clear that altematlve proteases site such as Enterokinase and aitemative purification 
tags such as longer histidine tags are also possible. Using one of a variety of 
30 reverse translation software tools [for example EditSeq best E. coli reverse 
translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)], the DNA 
• . ; ,. ',:- v« "^- sequence :-encodi^^ XarisinMayabje^tdaM^ detemninedr 
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Restriction sites are then incorporated into the DNA sequence and can be arranged 
as Nhe\-\mker-Spe\-Pst\-HN/A-Xba\-LEIEGRSGHHHHHHStop codon-HindlW (SEQ 
ID89). The DNA sequence is screened for restriction sequence incorporated and 
any-additional sequences are-removed manually from the remaining sequence 

s ensuring common E. coli codon usage is maintained. E coli codon usage is 
assessed by reference to software programs such as Graphical Codon Usage 
Analyser (Geneart), and the %GC content and codon usage ratio assessed by 
reference to published codon usage tables (for example GenBank Release 143, 13 
September 2004). This optimised DNA sequence is then commercially synthesized 

10 (for example by Entelechon, Geneart or Sigma-Genosys) and is provided in the pCR 
4 vector. In order to create CPNv-A-FXa-HT (SEQ ID90, removable his-tag 
construct) the pCR 4 vector encoding the removable his-tag is cleaved v\nth Nhe\ and 
HindWl The Nhe\-Hind\\\ fragment is then inserted into the LC/A-CPNv-Hn/A vector 
(SEQ ID51) that has also been cleaved by Nhe\ and HindlW. The final construct 

15 - contains the LC/A-linker-nociceptin variant-spacer-HN-FXa-Histag-H/ncflll ORF 
sequences (SEQ ID90) for expression as a protein of the sequence illustrated in 
SEQ ID91 . Figure 30 illustrates the purification of CPNv-A-FXa-HT from E. coli 
following the methods used in Example 26. 

20 Example 44 - Preparation of a leu-enkephalin targeted endopeptidase fusion 
protein containing a translocation domain derived from diphtlieria toxin 

The DNA sequence is designed by back translation of the amino acid sequence of 
the translocation domain of the diphtheria toxin (obtained from freely available 

25 database sources such as GenBank (accession number 1XDTT) using one of a 
variety of reverse translation software tools [for example EditSeq best E coli reverse 
translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)]. Restriction 
sites are then incorporated into the DNA sequence and can be arranged as Nhe\- 
Linker-Spel-Psfl- diphtheria translocation domain-Xbal-stop codon-H/A7c/lll (SEQ 

30 ID92). Psi\/Xba\ recognition sequences are incorporated at the 5' and 3' ends of the 
translocation domain respectively of the sequence maintaining the correct reading 
-/ - ^ ,^ -franaev."i JI^-BNA^seqy,§flgeu»^^€aej@^^ -such" a& MapDraw, 
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DNASTAR Inc.) for restriction enzyme cleavage sequences Incorporated during the 
back translation. Any cleavage sequences that are found to be common to those 
required by the cloning system are removed manually from the proposed coding 
sequence ensunng-eommon-E.- CO// codon usage is maintainedrE eoli co6on usage 

s is assessed by reference to software programs such as Graphical Codon Usage 
Analyser (Geneart). and the %GC content and codon usage ratio assessed by 
reference to published codon usage tables (for example GenBank Release 143, 1 3 
September 2004). This optimised DNA sequence containing the diphtheria 
translocation domain is then commercially synthesized as A//jel-Linker-Spel-Psfl- 

10 diphtheria translocation domain-Xdal-stop codon-H/ncfl 1 1 (for example by Entelechon, 
Geneart or Sigma-Genosys) and is provided in the pCR 4 vector (Invitrogen). The 
pCR 4 vector encoding the diphtheria translocation domain is cleaved with Nhe\ and 
Xbal. The Nhe\ -Xbal fragment Is then inserted into the LC/A-CPLE-Hn/A vector 
(SEQ ID80) that has also been cleaved by Nhel and Xbal. The final construct 

15 - contains the LC/A-leu-enkephalln-spacer-diphtheria translocation domain ORF 
sequences (SEQ ID93) for expression as a protein of the sequence illustrated in 
SEQ ID94. 

Example 45 - Preparation of a nociceptin variant targeted endopeptidase 
20 fusion protein containing a LC domain derived from tetanus toxin. 

The DNA sequence is designed by back translation of the tetanus toxin LC amino 
acid sequence (obtained from freely available database sources such as GenBank 
(accession number X04436) using one of a variety of reverse translation software 

25 tools [for example EditSeq best E. coli reverse translation (DNASTAR Inc.), or 
Backtranslation tool v2.0 (Entelechon)]. BamH\ISal\ recognition sequences are 
incorporated at the 5' and 3' ends respectively of the sequence maintaining the 
con-ect reading frame (SEQ ID95). The DNA sequence is screened (using software 
such as MapDraw, DNASTAR Inc.) for restriction enzyme cleavage sequences 

30 incorporated during the back translation. Any cleavage sequences that are found to 
be common to those required by the cloning system are removed manually from the 
-,proposdd=ie6dira^sgiqoeRC^f^ E 
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CO// codon usage is assessed by reference to software programs such as Graphical 
Codon Usage Analyser (Geneart), and the %GC content and codon usage ratio 
assessed by reference to published codon usage tables (for example GenBank 
Release4437 13 September 2004). This optimised DNA- sequence containing the 
5 tetanus toxin LC open reading frame (ORF) is then commercially synthesized (for 
example by Entelechon, Geneart or Sigma-Genosys) and is provided in the pCR 4 
vector (Invitrogen). The pCR 4 vector encoding the TeNT LC is cleaved with BamHI 
and San. The SamHI - Sa/I fragment is then inserted into the LC/A-CPNv-Hn/A 
vector (SEQ ID51) that has also been cleaved by BamHI and Sa/I. The final 
10 construct contains the TeNT LC-linker-nociceptin variant-spacer-HN ORF sequences 
(SEQ ID96) for expression as a protein of the sequence illustrated in SEQ ID97. 

Example 46 - Preparation of an LC/C-nociceptin variant-HN/C fusion protein 
witii a native serotype C linlcer tliat is susceptible to Factor Xa cleavage 

15 - - .... 

Follo^wing the methods used in Example 21, the LC/C (SEQ ID31) and Hn/C (SEQ 
ID32) are created and inserted into the C serotype nociceptin variant linker an-anged 
as Sa/nHI-Sa/l-linker-nociceptin variant-ZV/iel-spacer-Spel-Psfl-Xbal-stop codon- 
H/ndlll (SEQ ID98). The final construct contains the LC-linker-nociceptin variant- 
20 spacer-HN ORF sequences (SEQ I D99) for expression as a protein of the sequence 
illustrated in SEQ ID100. The fusion protein is termed CPNv-C (act. C). 
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Claims: 

1 . A non-cytotoxic protein conjugate for inhibition or reduction of exocytic fusion 
•ln-€hnocieeptive sensory affefenfceli-, -eomprisingr 

(I) Targeting Moiety (TM), 

wherein said Tl\^ is an agonist of a receptor present on said 
nociceptive sensory afferent cell, and wherein said receptor 
undergoes endocytosis to be incorporated into an endosome 
within the nociceptive sensory afferent cell; 

(jj) a non-(ytotoxic protease or a fragment thereof, 

wherein -the- protease or protease fragment is capable of 
cleaving a protein of the exocytic fusion apparatus of said 
nociceptive sensory afferent cell; and 

(ill) a Translocation Domain, 

wherein the Translocation Domain translocates the protease or 
protease fragment from within the endosome, across the 
endosomal membrane, and into the cytosol of the nociceptive 
sensory afferent cell. 

2. The non-cytotoxic conjugate of Claim 1 , wherein the receptor Is an ORLi 
receptor. 

3. The non-cytotoxic conjugate of Claim 1 or 2, wherein the TM has at least 
70% or at least 80 % homology to SEQ ID No. 2 or a fragment thereof. 
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4. The non-cytotoxic conjugate of Claim 1 or 2, wherein the TM has at least 
90% homology to SEQ ID No. 2 or a fragment thereof. 

-5r — -The non-cytotoxic-conjugate of Glaim 1 or Claim 2; wherein the TM has at 
5 least 95% homology to SEQ ID No. 2 or a fragment thereof. 

6. The non-cytotoxic conjugate of Claim 1 or 2, wherein the TM is SEQ I D No. 2 
or a fragment thereof. 

10 7, The non-cytotoxic conjugate of Claim 1 or 2, wherein the TM is nociceptin. 

8. The non-cytotoxic conjugate of Claim 1 or 2, wherein the TM is selected from 
the group consisting of SEQ ID Nos. 4, 6, 8, 10, 12, 14. 

15 9. The non-cytotoxic conjugate of any preceding claim, wherein the non- 
cytotoxic protease is a bacterial protein, or a fragment thereof, capable of 
cleaving a protein of the exocytic fusion apparatus of the nociceptive sensory 
afferent cell. 

20 10. The non-cytotoxic conjugate of Claim 9, wherein the non-cytotoxic protease is 
seilected from a clostridial neurotoxin, or an IgA protease. 

11. The non-cytotoxic conjugate of any preceding claim, wherein the 
Translocation Domain is derived from a clostridial source. 

25 

1 2. The non-cytotoxic conjugate of Claim 1 1 , wherein the Translocation Domain 
Is a botulinum Hn domain. 

1 3. The non-cytotoxic conjugate of any preceding claim, wherein the nociceptive 
30 sensory afferent cell is a primary nociceptive sensory afferent cell. 
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14. The non-cytotoxic conjugate according to Claim 1, wherein said conjugate 
comprises an amino acid sequence selected from the group consisting of 
SEQ ID NOs 14, 16. 18, 20, 22, 24, and 26. 

5 15. A non-cytotoxIc conjugate for Inhibition or reduction of exocytotic fusion In a 
nociceptive sensory afferent cell, comprising: 

(i) a Targeting Moiety (TIVI), 

10 wherein said TM is an agonist of a receptor present on said 

nociceptive sensory afferent cell, and wherein said receptor 
undergoes endoc^osls to be Incorporated Into an endosome 
within the nociceptive sensory afferent cell; 

IS (li) a DNA sequence encoding a non-cytotoxIc protease or a 

fragment thereof, 

wherein the DNA sequence is expressible in the nociceptive 
sensory afferent cell and when so expressed provides a 
20 protease or protease fragment capable of cleaving a protein of 

the exocytic fusion apparatus of said nociceptive sensory 
afferent cell; and 



25 



30 



(ill) a Translocation Domain, 

wherein the Translocation Domain translocates the DNA 
sequence encoding the protease or protease fragment from 
within the endosome, across the endosomai membrane, and 
Into the nociceptive sensory afferent cell. 

16. The non-cytotoxic conjugate of Claim 15, wherein the receptor is. an ORLi 
receptor. . , 



wo 2006/059105 



PCT/GB2005/004598 



95 



17. The non-cytotoxic conjugate of Claim 15 or 16, wherein the TIVl has at least 
70% or at least 80% homology to SEQ ID No. 2 or a fragment thereof. 

5 18. The non-cytotoxic conjugate of Claim 1 5 or 1 6, wherein the TM has at least 
90% homology to SEQ ID No. 2 or a fragment thereof. 

1 9. The non-cytotoxic conjugate of Claim 1 5 or Claim 1 6, wherein the TM has at 
least 95% homology to SEQ ID No. 2 or a fragment thereof. 

10 

20. The non-cytotoxic conjugate of Claim 1 5 or Claim 1 6, wherein the TM is SEQ 
ID No. 2 or a fragment thereof. 

21. The non-cytotoxic conjugate of Claim 15 or Claim 16, wherein the TM Is 
IS nociceptin. 

22. The non-cytotoxic conjugate of Claim 15 or Claim 16, wherein the TM is 
selected from the group consisting of SEQ ID Nos. 4, 6, 8, 10, 12, 14. 

20 23. The non-cytotoxic conjugate of Claim 15 or Claim 16, wherein the non- 
cytotoxic protease is a bacterial protein, or a fragment thereof, capable of 
cleaving a protein of the exoc^ic fusion apparatus of the nociceptive sensory 
afferent cell. 

25 24. The non-cytotoxic conjugate of Claim 23, wherein the non-cytotoxic protease 
is selected from a clostridial neurotoxin, or an IgA protease. 

25. The non-cytotoxic cx)njugate of Claim 15 or Claim 16, wherein the 
Translocation Domain Is derived from a clostridial source. 

30 

26. The non-cytotoxic conjugate of Claim 25, wherein the Translocation Domain 
.r is a botulinum Hn domain. 
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27. The non-cytotoxic conjugate of Claim 15 or 16. wherein the nociceptive 
sensory afferent cell is a primary nociceptive sensory afferent cell. 

28. The non-cytotoxic conjugate of any of Claihis 1 to 14, wherein the TM, the 
Translocation Domain and the protease or fragment thereof are covalently 
linked. 

29. The non-cytotoxic conjugate of any of Claims 1 5 to 27. wherein the TM, the 
Translocation Domain and the DNA sequence encoding the protease or 
fragment thereof, are covalently linked. 

30. A phamnaceutlcal composition, comprising a conjugate according to any of 
Claims 1 to 29 and a pharmaceutically acceptable earner. 

31 . A DNA construct encoding the conjugate of any of Claims 1 to 1 4. 

32. A DNA construct according to Claim 31 , wherein the construct comprises a 
DNA sequence selected from SEQ ID NOs 13, 15, 17. 19, 21, 23, and 25. 

33. A method of preparing the conjugate of any of Claims 1 to 14, comprising 
expressing the DNA construct of Claim 30 in a host cell. 

34. A method for treating pain, comprising administering to a patient a conjugate 
according to any of Claim 1-29 or a composition according to Claim 30. 

35. Use of a conjugate according to any of Claims 1-29 or a composition 
according to Claim 30, for the manufacture of a medicament for treating pain. 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 5 
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Figure 6 
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Figure 7 
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Figure 8 



Competition Assay : Nociceptin-LH,yA Fusions 
vs 1nM f H]-Nociceptin on eDRGs (4°C) 
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Figure 9 
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Figure 10 
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Figure 11 



Competition Assay: CPN fusions vs InM [3H] - Nociceptin 
on eDRGs for 1 hour at 4°C 
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Figure 13 



CPN-A on eDRG for 1 Day 
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Figure 14 



Duration of action following eDRG exposure for 1 Day 
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Figure 15 
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Figure 16 
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Figure 17 
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Figure 18 
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Figure 19 
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Figure 20 
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Figure 21 
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Figure 22 
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Figure 23 
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Figure 24 
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Figure 25 
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Figure 26 
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Figure 27 
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Figure 28 
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Figure 29 
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Figure 30 
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Figure 31 
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Figure 32 




Concentration (nM) 
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SEQUENCE LISTING 

<110> Health Protection Agency 
Allergan, Inc. 
Chaddock , J ohn 
Foster, Keith 
Penn, Charles 
Aoki, K Roger 
Francis, Joseph 
Steward, Lance 

<120> Re-targeted toxin conjugates 

<130> P27211W0/MRM 

<150> GB0426394.3 
<151> 2004-12-01 

<160> 100 

<170> Patentin version 3.3 

<210> 1 

<211> 51 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 

<400> 1 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca g 51 



<210> 2 
<211> 17 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 2 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
15 10 15 



Gin 



<210> 3 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 3 
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tttggcggtt tcacgggcgc acgcaaatca gcg 33 



<210> 4 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 4 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala 
1 5 10 



<210> 


5 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


5 



tttggcggtt tcacgggcgc acgcaaatat gcg- 33 



<210> 6 
<211> 11 
<212> PRT 

<213> Artificial Setjuence 

<220> 

<223> Synthetic 
<400> 6 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Tyr Ala 
15 10 



<210> 


.7 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


7 



tttggcggtt tcacgggcgc acgcaaatca tat 33 



<210> 8 

<211> 11 

<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Synthetic 
<400> 8 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Tyr 
15 10 



<210> 


9 


<211> 


51 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


9 



tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat atgctaacca g 51 



<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 10 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Tyr Ala Asn 
15 10 15 



Gin 



<210> 11 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 11 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaa 39 



<210> 12 

<211> 13 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 12 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys 



wo 2006/059105 



4 



PCT/GB2005/004598 



1 


5 


<210> 


13 


<211> 


51 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


13 



10 



tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gcaaaaacca g 51 



<210> 14 

<211> 17 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 14 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
1 5 10 15 



Gin 



<210> 15 
<211> 2736 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 15 

ctcgggattg agggtcgttt tggcggtttc acgggcgcac gcaaatcagc gcgtaaatta 60 
gctaaccaga ctagtggcgg tgggggtagt ggcggtggcg gttcgggcgg gggtgggagc 120 
cctaggggat ccatggagtt cgttaacaaa cagttcaact ataaagaccc agttaacggt 180 
gttgacattg cttacatcaa aatcccgaac gctggccaga tgcagccggt aaaggcattc 240 
aaaatccaca acaaaatctg ggttatcccg gaacgtgata cctttactaa cccggaagaa 300 
ggtgacctga acccgccacc ggaagcgaaa caggtgccgg tatcttacta tgactccacc 360 
tacctgtcta ccgataacga aaaggacaac tacctgaaag gtgttactaa actgttcgag 420 
cgtatttact ccaccgacct gggccgtatg ctgctgacta gcatcgttcg cggtatcccg 480 
ttctggggcg gttctaccat cgataccgaa ctgaaagtaa tcgacactaa ctgcatcaac 540 
gttattcagc cggacggttc ctatcgttcc gaagaactga acctggtgat catcggcccg 600 
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tctgctgata tcatccagtt cgagtgtaag agctttggtc acgaagttct gaacctcacc 660 

cgtaacggct acggttccac tcagtacatc cgtttctctc cggacttcac cttcggtttt 720 

gaagaatccc tggaagtaga cacgaaccca ctgctgggcg ctggtaaatt cgcaactgat 780 

cctgcggtta ccctggctca cgaactgatt catgcaggcc accgcctgta cggtatcgcc 840 

atcaatccga accgtgtctt caaagttaac accaacgcgt attacgagat gtccggtctg 900 

gaagttagct tcgaagaact gcgtactttt ggcggtcacg acgctaaatt catcgactct 960 

ctgcaagaaa acgagttccg tctgtactac tataacaagt tcaaagatat cgcatccacc 1020 

ctgaacaaag cgaaatccat cgtgggtacc actgcttctc tccagtacat gaagaacgtt 1080 

tttaaagaaa aatacctgct cagcgaagac acctccggca aattctctgt agacaagttg 1140 

aaattcgata aactttacaa aatgctgact gaaatttaca ccgaagacaa cttcgttaag 1200 

ttctttaaag ttctgaaccg caaaacctat ctgaacttcg acaaggcagt attcaaaatc 1260 

aacatcgtgc cgaaagttaa ctacactatc tacgatggtt tcaacctgcg taacaccaac 1320 

ctggctgcta attttaacgg ccagaacacg gaaatcaaca acatgaactt cacaaaactg 1380 

aaaaacttca ctggtctgtt cgagttttac aagctgctgt gcgtcgacgg • catcattacc 1440 

tccaaaacta aatctctgat agaaggtaga aacaaagcgc tgaacgacct ctgtatcaag 1500 

gttaacaact gggatttatt cttcagcccg agtgaagaca acttcaccaa cgacctgaac 1560 

aaaggtgaag aaatcacctc agatactaac atcgaagcag ccgaagaaaa catctcgctg 1620 

gacctgatcc agcagtacta cctgaccttt aatttcgaca acgagccgga aaacatttct 1680 

atcgaaaacc tgagctctga tatcatcggc cagctggaac tgatgccgaa catcgaacgt 1740 

ttcccaaacg gtaaaaagta cgagctggac aaatatacca tgttccacta cctgcgcgcg 1800 

caggaatttg aacacggcaa atcccgtatc gcactgacta actccgttaa cgaagctctg 1860 

ctcaacccgt cccgtgtata caccttcttc tctagcgact acgtgaaaaa ggtcaacaaa 1920 

gcgactgaag ctgcaatgtt cttgggttgg gttgaacagc ttgtttatga" ttttaccgac 1980 

gagacgtccg aagtatctac taccgacaaa attgcggata tcactatcat catcccgtac 2040 

atcggtccgg ctctgaacat tggcaacatg ctgtacaaag acgacttcgt tggcgcactg 2100 

atcttctccg gtgcggtgat cctgctggag ttcatcccgg aaatcgccat cccggtactg 2160 

ggcacctttg ctctggtttc ttacattgca aacaaggttc tgactgtaca aaccatcgac ' 2220 

aacgcgctga gcaaacgtaa cgaaaaatgg gatgaagttt acaaatatat cgtgaccaac 2280 

tggctggcta aggttaatac tcagatcgac ctcatccgca aaaaaatgaa agaagcactg 2340 

gaaaaccagg cggaagctac caaggcaatc attaactacc agtacaacca gtacaccgag 2400 
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gaagaaaaaa acaacatcaa cttcaacatc gacgatctgt cctctaaact gaacgaatcc 2460 

atcaacaaag ctatgatcaa catcaacaag ttcctgaacc agtgctctgt aagctatctg 2520 

atgaactcca tgatcccgta cggtgttaaa cgtctggagg acttcgatgc gtctctgaaa 2580 

gacgccctgc tgaaatacat ttacgacaac cgtggcactc tgatcggtca ggttgatcgt 2640 

ctgaaggaca aagtgaacaa taccttatcg accgacatcc cttttcagct cagtaaatat 2700 

gtcgataacc aacgcctttt gtccactcta gactag 2736 



<210> 16 
<211> 911 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 16 

Leu Gly He Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser 
15 10 15 



Ala Arg Lys Leu Ala Asn Gin Thr Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 



Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg Gly Ser Met Glu Phe Val 
35 40 45 



Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly Val Asp He Ala 
50 55 60 



Tyr He Lys He Pro Asn Ala Gly Gin Met Gin Pro Val Lys Ala Phe 
65 70 75 80 



Lys He His Asn Lys He Trp Val He Pro Glu Arg Asp Thr Phe Thr 
85 90 95 



Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala Lys Gin Val 
100 105 110 ■ 



Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp Asn Glu Lys 
115 120 125 



Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu Arg He Tyr Ser 
130 135 140 



.Thr Asj>. Leu..Gly Arg..Mgt^j.L^^ ^TliT; Ser. He Val Arg Gly-.He Pro 
145 ' - 150 155 ' 160 



wo 2006/059105 



7 



PCT/GB2005/004598 



Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys Val He Asp Thr 
165 170 175 



Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr Arg Ser Glu Glu 
180 185 190 



Leu Asn Leu Val He He Gly Pro Ser Ala Asp He He Gin Phe Glu 
195 200 205 



Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr Arg Asn Gly Tyr 
210 215 220 



Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe Thr Phe Gly Phe 
225 230 235 240 



Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly Ala Gly Lys 
245 250 255 



Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu Leu He His Ala 
260 265 270 



Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn Arg Val Phe Lys 
275 280 285 



Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu Val Ser Phe 
290 295 300 



Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe He Asp Ser 
305 310 315 320 



Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys Phe Lys Asp 
325 330 335 



He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val Gly Thr Thr Ala 
340 345 350 



Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr Leu Leu Ser 
355 360 ' 365 



Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys Phe Asp Lys 
370 375 380 



Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp Asn Phe Val Lys 
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Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe Asp Lys Ala 
405 410 415 



Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr Thr He Tyr Asp 
420 425 430 



Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe Asn Gly Gin 
435 440 445 



Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu Lys Asn Phe Thr 
450 455 460 



Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly He He Thr 
465 470 475 480 



Ser Lys Thr Lys Ser Leu He Glu Gly Arg Asn Lys Ala Leu Asn Asp 
485 490 495 



Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp 
515 520 525 



Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin 
530 535 540 



Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser 
545 550 555 560 



He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 605 



Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys 
625 630 635 640 
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Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala 
660 665 670 



Asp He Thr He He lie Pro Tyr He Gly Pro Ala Leu Asn He Gly 
675 , 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly 
690 695 700 



Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val 
725 730 735 



Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
770 775 780 



Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 825 830 



Asn. Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg 
865 870 875 880 

Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin 
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885 890 895 



Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 

<210> 17 
<211> 2700 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 17 



tttggcggtt 


tcacgggcgc 


acgcaaatca tatactagtg 


gcggtggggg 


tagtggcggt 


60 


ggcggttcgg 


gcgggggtgg 


gagccctagg ggatccatgg 


agttcgttaa 


caaacagttc 


120 


aactataaag 


acccagttaa 


cggtgttgac attgcttaca 


tcaaaatccc 


gaacgctggc 


180 


cagatgcagc 


cggtaaaggc 


attcaaaatc cacaacaaaa 


tctgggttat 


cccggaacgt 


240 


gataccttta 


ctaacccgga 


agaaggtgac ctgaacccgc 


caccggaagc 


gaaacaggtg 


300 


ccggtatctt 


actatgactc 


cacctacctg tctaccgata 


acgaaaagga 


caactacctg 


360 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt tactccaccg 


acctgggccg 


tatgctgctg 


420 


actagcatcg 


ttcgcggtat 


cccgttctgg ggcggttcta 


ccatcgatac 


cgaactgaaa 


480 


gtaatcgaca 


ctaactgcat 


caa'cgttatt cagccggacg 


gttcctatcg 


ttccgaagaa 


540 


ctgaacctgg 


tgatcatcgg 


cccgtctgct gatatcatcc 


agttcgagtg 


taagagcttt 


600 


ggtcacgaag 


ttctgaacct 


cacccgtaac ggctacggtt 


ccactcagta 


catccgtttc 


660 


tctccggact 


tcaccttcgg 


ttttgaagaa tccctggaag 


tagacacgaa 


cccactgctg 


720 


ggcgctggta 


aattcgcaac 


tgatcctgcg gttaccctgg 


ctcacgaact 


gattcatgca 


780 


ggccaccgcc 


tgtacggtat 


cgccatcaat ccgaaccgtg 


tcttcaaagt 


taacaccaac 


840 


gcgtattacg 


agatgtccgg 


tctggaagtt agcttcgaag 


aactgcgtac 


ttttggcggt 


900 


cacgacgcta 


aattcatcga 


ctctctgcaa gaaaacgagt 


tccgtctgta 


ctactataac 


960 


aagttcaaag 


atatcgcatc 


caccctgaac aaagcgaaat 


ccatcgtggg 


taccactgct 


1020 


tctctccagt 


acatgaagaa 


cgtttttaaa gaaaaatacc 


tgctcagcga 


agacacctcc 


1080 


ggcaaattct 


ctgtagacaa 


gttgaaattc gataaacttt 


acaaaatgct 


gactgaaatt 


1140 


tacaccgaag 


acaacttcgt 


taagttcttt aaagttctga 


accgcaaaac 


ctatctgaac 


1200 


ttcgacaagg 


cagtattcaa 


aatcaacatc gtgccgaaag 


ttaactacac 


tatctacgat 


1260 


ggtttcaacc 


tgcgtaacac 


caacctggct gctaatttta 


acggccagaa 


cacggaaatc 


1320 


aacaacatga 


acttcacaaa 


actgaaaaac ttcactggtc 


tgttcgagtt 


ttacaagctg 


1380 
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ctgtgcgtcg acggcatcat tacctccaaa actaaatctc tgatagaagg tagaaacaaa 1440 

gcgctgaacg acctctgtat caaggttaac aactgggatt tattcttcag cccgagtgaa 1500 

gacaacttca ccaacgacct gaacaaaggt gaagaaatca cctcagatac taacatcgaa 1560 

gcagccgaag aaaacatctc gctggacctg atccagcagt actacctgac ctttaatttc 1620 

gacaacgagc cggaaaacat ttctatcgaa aacctgagct ctgatatcat cggccagctg 1680 

gaactgatgc cgaacatcga acgtttccca aacggtaa^a agtacgagct ggacaaatat 1740 

accatgttcc actacctgcg cgcgcaggaa tttgaacacg gcaaatcccg tatcgcactg 1800 

actaactccg ttaacgaagc tctgctcaac ccgtcccgtg tatacacctt cttctctagc 1860 

gactacgtga aaaaggtcaa caaagcgact gaagctgcaa tgttcttggg ttgggttgaa 1920 

cagcttgttt atgattttac cgacgagacg tccgaagtat ctactaccga caaaattgcg 1980 

gatatcacta tcatcatccc gtacatcggt ccggctctga acattggcaa catgctgtac 2040 

aaagacgact tcgttggcgc actgatcttc tccggtgcgg tgatcctgct ggagttcatc 2100 

ccggaaatcg ccatcccggt actgggcacc tttgctctgg tttcttacat tgcaaacaag 2160 

gttctgactg tao^aaccat cgacaacgcg ctgagcaaac gtaacgaaaa atgggatgaa 2220 

gtttacaaat atatcgtgac caactggctg gctaaggtta atactcagat cgacctcatc 2280 

cgcaaaaaaa tgaaagaagc actggaaaac caggcggaag ctaccaaggc aatcattaac 2340 

taccagtaca accagtacac cgaggaagaa aaaaacaaca tcaacttcaa catcgacgat 2400 

ctgtcctcta aactgaacga atccatcaac aaagctatga tcaacatcaa caagttcctg 2460 

aaccagtgct ctgtaagcta tctgatgaac tccatgatcc cgtacggtgt taaacgtctg 2520 

gaggacttcg atgcgtctct gaaagacgcc ctgctgaaat acatttacga caaccgtggc 2580 

actctgatcg gtcaggttga tcgtctgaag gacaaagtga acaatacctt atcgaccgac 2640 

atcccttttc agctcagtaa atatgtcgat aaccaacgcc ttttgtccac tctagactag 2700 

<210> 18 
<211> 899 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 18 

Pile Gly Gly Phe Thr Gly Ala Arg Lys Ser Tyr Thr -Ser Gly Gly Gly 
1 5 10 .15 

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg Gly Ser 
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20 25 30 



Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly 
35 40 45 



Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met Gin Pro 
50 55 60 



Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro Glu Arg 
65 70 75 80 



Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu 
85 90 95 



Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 
100 105 110 



Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 
115 120 125 



Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He Val 
130 135 140 



Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys 
145 150 155 160 



Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr 
165 170 175 



Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala Asp He 
180 185 190 



He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr 
195 200 205 



Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 
210 215 220 



Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 
225 .230 235 240 



Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 
245 250 255 



Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn 
260 265 270 
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Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu 
275 280 285 



Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys 
290 295 300 



Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
305 310 315 320 



Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser lie Val 
325 330 335 



Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys 
340 345 350 



Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 
355 360 365 



Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr Glu Asp 
370 375 380 



Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
385 390 395 400 



Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val Asn Tyr 
405 410. 415 



Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 
420 425 430 



Phe Asn Gly Gin TVsn Thr Glu lie Asn Asn Met Asn Phe Thr Lys Leu 
435 440 445 



Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp 
450 455 460 



Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg Asn Lys 
465 470 475 480 



Ala Leu Asn Asp Leu Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe 
485 490 495 



Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 

:y • ■• .'500 - '-^^ ' ' 50F- 510 
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He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu 
515 520 525 



Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
530 535 540 



Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 
545 550 555 560 



Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 
565 570 575 



Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 
580 585 590 



His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
595 600 605 



Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys 
610 615 620 



Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 
625 630 635 640 



Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
645 650 655 



Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala 
660 665 670 



Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
675 680 685 



He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala 
690 695 700 



He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 
705 710 715 720 



Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu 
725 730 735 



Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys 
740 745 750 
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Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu 
755 760 765 



Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 
770 775 780 



Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp 
785 790 795 800 



Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He 
805 810 815 



Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
820 825 830 



He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 
835 840 845 



Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly 
850 855 860 



Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 
865 870 875 880 



He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
885 890 895 



Thr Leu Asp 



<210> 19 
<211> 2706 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 19 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaaa ctagtggcgg tgggggtagt 60 
ggcggtggcg gttcgggcgg gggtgggagc cctaggggat ccatggagtt cgttaacaaa 120 
cagttcaact ataaagaccc agttaacggt gttgacattg cttacatcaa aatcccgaac 180 
gctggccaga tgcagccggt aaaggcattc aaaatccaca acaaaatctg ggttatcccg 240 
gaacgtgata cctttactaa cccggaagaa ggtgacctga acccgccacc ggaagcgaaa 300 
caggtgc/:gg tatcttacta tgactccacc tacctgtcta ccgataacga aaaggacaac 360 
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tacctgaaag gtgttactaa actgttcgag cgtatttact ccaccgacct gggccgtatg 420 

ctgctgacta gcatcgttcg cggtatcccg ttctggggcg gttctaccat cgataccgaa 480 

ctgaaagtaa tcgacactaa ctgcatcaac gttattcagc cggacggttc ctatcgttcc 540 

gaagaactga acctggtgat catcggcccg tctgctgata tcatccagtt cgagtgtaag 600 

agctttggtc acgaagttct gaacctcacc cgtaacggct acggttccac tcagtacatc 660 

cgtttctctc cggacttcac cttcggtttt gaagaatccc tggaagtaga cacgaaccca 720 

ctgctgggcg ctggtaaatt cgcaactgat cctgcggtta ccctggctca cgaactgatt 780 

catgcaggcc accgcctgta cggtatcgcc atcaatccga accgtgtctt caaagttaac 840 

accaacgcgt attacgagat gtccggtctg gaagttagct tcgaagaact gcgtactttt 900 

ggcggtcacg acgctaaatt catcgactct ctgcaagaaa acgagttccg tctgtactac 960 

tataacaagt tcaaagatat cgcatccacc ctgaacaaag cgaaatccat cgtgggtacc 1020 

actgcttctc tccagtacat gaagaacgtt tttaaagaaa aatacctgct cagcgaagac 1080 

acctccggca aattctctgt agacaagttg aaattcgata aactttacaa aatgctgact 1140 

gaaatttaca ccgaagacaa cttcgttaag ttctttaaag ttctgaaccg caaaacctat 1200 

ctgaacttcg acaaggcagt attcaaaatc aacatcgtgc cgaaagttaa ctacactatc 1260 

tacgatggtt tcaacctgcg taacaccaac ctggctgcta attttaacgg ccagaacacg 1320 

gaaatcaaca acatgaactt .cacaaaactg aaaaacttca ctggtctgtt cgagttttac 1380 

aagctgctgt gcgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 1440 

aacaaagcgc tgaacgacct ctgtatcaag gttaacaact gggatttatt cttcagcccg 1500 

agtgaagaca acttcaccaa cgacctgaac aaaggtgaag aaatcacctc agatactaac 1560 

atcgaagcag ccgaagaaaa catctcgctg gacctgatcc agcagtacta cctgaccttt 1620 

aatttcgaca acgagccgga aaacatttct atcgaaaacc tgagctctga tatcatcggc 1680 

cagctggaac tgatgccgaa catcgaacgt ttcccaaacg gtaaaaagta cgagctggac 1740 

aaatatacca tgttccacta cctgcgcgcg caggaatttg aacacggcaa atcccgtatc 1800 

gcactgacta actccgttaa cgaagctctg ctcaacccgt cccgtgtata caccttcttc 1860 

tctagcgact acgtgaaaaa ggtcaacaaa gcgactgaag ctgcaatgtt cttgggttgg 1920 

gttgaacagc ttgtttatga ttttaccgac gagacgtccg aagtatctac taccgacaaa 1980 

attgcggata tcactatcat catcccgtac atcggtccgg ctctgaacat tggcaacatg 2040 

ctgtacaaag acgacttcgt tggcgcactg atcttctccg gtgcggtgat cctgctggag 2100 

ttcatcccgg aaatcgccat cccggtactg ggcacctttg ctctggtttc ttacattgca 2160 

aacaaggttc tgactgtaca aaccatcgac aacgcgctga gcaaacgtaa cgaaaaatgg 2220 



wo 2006/059105 



17 



PCT/GB20D5/004598 



gatgaagttt acaaatatat cgtgaccaac tggctggcta aggttaatac tcagatcgac 2280 

ctcatccgca aaaaaatgaa agaagcactg gaaaaccagg cggaagctac caaggcaatc 2340 

attaactacc agtacaacca gtacaccgag gaagaaaaaa acaacatcaa cttcaacatc 2400 

gacgatctgt cctctaaact gaacgaatcc atcaacaaag ctatgatcaa catcaacaag 2460 

ttcctgaacc agtgctctgt aagctatctg atgaactcca tgatcccgta cggtgttaaa 2520 

cgtctggagg acttcgatgc gtctctgaaa gacgccctgc tgaaatacat ttacgacaac 2580 

cgtggcactc tgatcggtca ggttgatcgt ctgaaggaca aagtgaacaa taccttatcg 2640 

accgacatcc cttttcagct cagtaaatat gtcgataacc aacgcctttt gtccactcta 2700 

gactag 2706 



<210> 20 
<211> 901 
<212> PRT 

<213> Artificial Secjuence 
<220> 

<223> Synthetic 
<400> 20 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Thr Ser Gly 
15 10 15 



Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg 
20 25 30 



Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
35 40 45 



Asn Gly Val Asp He Ala Tyr He Lys lie Pro Asn Ala Gly Gin Met 
50 55 60 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
65 70 75 80 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
85 90 95 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
100 105 110 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
. . -.115 ..'. ^ . ^.^ 12.0.- . ■ . .125 
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Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
130 135 140 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
145 150 155 160 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
165 170 175 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
180 185 190 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
195 200 205 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
210 215 220 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
225 230 235 240 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
245 250 255 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
260 265 270 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
275 280 285 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
290 295 300 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
305 310 315 320 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
325 330 335 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
340 345 350 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
355 360 365 
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Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
370 375 380 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
385 390 395 400 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
405 410 415 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
420 425 430 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
435 440 445 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
450 455 460 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
465 470 475 480 



Asn Lys Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu 
485 490 495 



Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys. Gly 
500 505 510 



Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He 
515 520 525 



Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn 
530 535 540 



Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly 
545 550 555 560 



Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys 
565 570 575 



Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu 
580 585 590 



Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu 
595 600 605 



Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr 
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610 



615 



620 



Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp 
625 630 635 640 



Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser 
645 650 655 



Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly 
660 665 670 



Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly 
675 680 685 



Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu 
690 695 700 



He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala 
705 710 715 720 



Asn Lys Val Leu Thr Val Gin J'hr He Asp Asn Ala Leu Ser Lys Arg 
725 730 735 



Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu 
740 745 750 



Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu 
755 760 765 



Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin 
770 775 780 



Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He 
785 790 795 800 



Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He 
805 810 815 



Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn 
820 825 830 



Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser 
835 840 845 



Leu Lys Asp Ala Leu Leu Lys Tyr He T^'X Asp Asn Arg Gly Thr Leu 
850 855 860 



wo 2006/059105 



21 



PCT/GB2005/004598 



lie Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser 
865 870 875 880 

Thr Asp lie Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu 
885 890 895 

Leu Ser Thr Leu Asp 
900 

<210> 21 
<211> 2691 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 21 

atggagttcg ttaacaaaca gttcaactat aaagacccag ttaacggtgt tgacattgct 60 

tacatcaaaa tcccgaacgc tggccagatg cagccggtaa aggcattcaa aatccacaac 120 

aaaatctggg ttatcccgga acgtgatacc tttactaacc cggaagaagg tgacctgaac 180 

ccgccaccgg aagcgaaaca ggtgccggta tcttactatg actccaccta cctgtctacc 240 

gataacgaaa aggacaacta cctgaaaggt gttactaaac tgttcgagcg tatttactcc 300 

accgacctgg gccgtatgct gctgactagc atcgttcgcg gtatcccgtt ctggggcggt 360 

tctaccatcg ataccgaact gaaagtaatc gacactaact gcatcaacgt tattcagccg 420 

gacggttcct atcgttccga agaactgaac ctggtgatca tcggcccgtc tgctgatatc 480 

atccagttcg agtgtaagag ctttggtcac gaagttctga acctcacccg taacggctac 540 

ggttccactc agtacatccg tttctctccg gacttcacct tcggttttga agaatccctg 600 

gaagtagaca cgaacccact gctgggcgct ggtaaattcg caactgatcc tgcggttacc 660 

ctggctcacg aactgattca tgcaggccac cgcctgtacg gtatcgccat caatccgaac 720 

cgtgtcttca aagttaacac caacgcgtat tacgagatgt ccggtctgga agttagcttc 780 

gaagaactgc gtacttttgg cggtcacgac gctaaattca tcgactctct gcaagaaaac 840 

gagttccgtc tgtactacta taacaagttc aaagatatcg catccaccct gaacaaagcg 900 

aaatccatcg tgggtaccac tgcttctctc cagtacatga agaacgtttt taaagaaaaa 960 

tacctgctca gcgaagacac ctccggcaaa ttctctgtag acaagttgaa attcgataaa 1020 

ctttacaaaa tgctgactga aatttacacc gaagacaact tcgttaagtt ctttaaagtt 1080 

ctgaaccgca aa^cctatct .gaacttcgac:i.aaggcagteatcitcaaaatcaa catcgtgccg 1140 
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aaagttaact acactatcta cgatggtttc aacctgcgta acaccaacct ggctgctaat 1200 

tttaacggcc agaacacgga aatcaacaac atgaacttca caaaactgaa aaacttcact 1260 

ggtctgttcg agttttacaa gctgctgtgc gtcgacggca tcattacctc caaaactaaa 1320 

tctctgatag aaggtagaaa caaagcgctg aacgacctct gtatcaaggt taacaactgg 1380 

gatttattct tcagcccgag tgaagacaac ttcaccaacg acctgaacaa aggtgaagaa 1440 

atcacctcag atactaacat cgaagcagcc gaagaaaaca tctcgctgga cctgatccag 1500 

cagtactacc tgacctttaa tttcgacaac gagccggaaa acatttctat cgaaaacctg 1560 

agctctgata tcatcggcca gctggaactg atgccgaaca tcgaacgttt cccaaacggt 1620 

aaaaagtacg agctggacaa atataccatg ttccactacc tgcgcgcgca ggaatttgaa 1680 

cacggcaaat cccgtatcgc actgactaac tccgttaacg aagctctgct caacccgtcc 1740 

cgtgtataca ccttcttctc tagcgactac gtgaaaaagg tcaacaaagc gactgaagct 1800 

gcaatgttct tgggttgggt tgaacagctt gtttatgatt ttaccgacga gacgtccgaa 1860 

gtatctacta ccgacaaaat tgcggatatc actatcatca tcccgtacat cggtccggct 1920 

ctgaacattg gcaacatgct gtacaaagac gacttcgttg gcgcactgat cttctccggt 1980 

gcggtgatcc tgctggagtt catcccggaa atcgccatcc cggtactggg cacctttgct 2040 

ctggtttctt acattgcaaa caaggttctg acfcgtacaaa ccatcgacaa cgcgctgagc 2100 

aaacgtaacg aaaaatggga tgaagtttac aaatatatcg tgaccaactg gctggctaag 2160 

gttaatactc agatcgacct catccgcaaa aaaatgaaag aagcactgga aaaccaggcg 2220 

gaagctacca aggcaatcat taactaccag tacaaccagt acaccgagga agaaaaaaac 2280 

aacatcaact tcaacatcga cgatctgtcc tctaaactga acgaatccat caacaaagct 2340 

atgatcaaca tcaacaagtt cctgaaccag tgctctgtaa gctatctgat gaactccatg 2400 

atcccgtacg gtgttaaacg tctggaggac ttcgatgcgt ctctgaaaga cgccctgctg 2460 

aaatacattt acgacaaccg tggcactctg atcggtcagg ttgatcgtct gaaggacaaa 2520 

gtgaacaata ccttatcgac cgacatccct tttcagctca gtaaatatgt cgataaccaa 2580 

cgccttttgt ccactggcgg tgggggtagt ggcggtggcg gttcgggcgg gggtgggagc 2640 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca g 2691 

<210> 22 
<211> 897 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<400> 22 

Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly 
15 10 15 



Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met Gin Pro 
20 25 30 



Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro Glu Arg 
35 40 45 



Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu 
SO 55 60 



Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 
65 70 75 80 



Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 
85 90 95 



Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He Val 
100 105 110 



Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys 
115 120 125 



Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr 
130 135 140 



Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala Asp He 
145 150 155 160 



He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr 
165 170 175 



Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 
180 185 190 



Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 
195 200 205 



Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 
210 215 220 



Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn 

. 2a5iL .,:'.x -t..*^ 230.*- .... 235 240 
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Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu 
245 250 255 



Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys 
260 265 270 



Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
275 280 285 



Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val 
290 295 300 



Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys 
305 310 315 320 



Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 
325 330 335 



Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp 
340 345 350 



Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
355 360 365 



Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr 
370 375 380 



Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 
385 390 395 400 



Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu 
405 410 415 



Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp 
420 425 430 



Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg Asn Lys 
435 ' 440 445 



Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe 
450 455 460 



Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 
465 470 475 480 
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He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu 
485 490 495 



Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 505 510 



Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 
.515 520 525 



Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 
530 535 540 



Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 
545 550 555 560 



His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
565 570 575 



Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys 
580 585 590 



Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 
595 600 605 



Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
610 615 620 



Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala 
625 630 635 640 



Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
645 650 655 



He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala 
660 665 670 



He. Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 
675 680 685 



Val Leu Thr Val Gin- Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu 
690 695 700 



Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys 
705 710 715 720 



Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu 
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725 730 735 



Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 
740 745 750 



Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp 
755 760 765 



Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He 
770 775 780 



Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
785 790 795 800 



He Pro Tyr Gly Val Lys Arg Leu Glu Tlsp Phe Asp Ala Ser Leu Lys 
805 810 815 



Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly 
820 825 830 



Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 
835 840 845 



He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
850 855 860 



Thr Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
865 870 875 880 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
885 890 895 



Gin 



<210> 23 
<211> 2676 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 23 

atgccaataa caattaacaa ctttaattat tcagatcctg ttgataataa aaatatttta 60 
tatttagata ctcatttaaa tacactagct aatgagcctg aaaaagcct't tcgcattaca 120 
ggaaatatat gggtaatacc tgatagattt tcaagaaatt ctaatccaaa tttaaataaa 180 
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cctcctcgag ttacaagccc taaaagtggt tattatgatc ctaattattt gagtactgat 240 

tctgacaaag atacattttt aaaagaaatt ataaagttat ttaaaagaat taattctaga 300 

gaaataggag aagaattaat atatagactt tcgacagata taccctttcc tgggaataac 360 

aatactccaa ttaatacctt tgattttgat gtagatttta acagtgttga tgttaaaact 420 

agacaaggta acaactgggt taaaactggt agcataaatc ctagtgttat aataactgga 480 

cctagagaaa acattataga tccagaaact tctacgttta aattaactaa caataccttt 540 

gcggcacaag aaggatttgg tgctttatca ataatttcaa tatcacctag at tt at get a 600 

acatatagta atgcaactaa tgatgtagga gagggtagat tttctaagtc tgaattttgc 660 

atggatccaa tactaatttt aatgcatgaa cttaatcatg caatgcataa tttatatgga 720 

atagctatac caaatgatca aacaatttca tctgtaacta gtaatatttt ttattctcaa 780 

tataatgtga aattagagta tgcagaaata tatgcatttg gaggtccaac tatagacctt 840 

attcctaaaa gtgcaaggaa atattttgag gaaaaggcat tggattatta tagatctata 900 

gctaaaagac ttaatagtat aactactgca aatccttcaa gctttaataa atatataggg 960 

gaatataaac agaaacttat tagaaagtat agattcgtag tagaatcttc aggtgaagtt 1020 

acagtaaatc gtaataagtt tgttgagtta tataatgaac ttacacaaat atttacagaa 1080 

tttaactacg ctaaaatata taatgtacaa aataggaaaa tatatctttc aaatgtatat 1140 

actccggtta cggcgaatat attagacgat aatgtttatg atatacaaaa tggatttaat 1200 

atacctaaaa gtaatttaaa tgtactattt atgggtcaaa atttatctcg aaatccagca 1260 

ttaagaaaag tcaatcctga aaatatgctt tatttattta caaaattttg tcataaagca 1320 

atagatggta gatcattata taataaaaca ttagattgta gagagctttt agttaaaaat 1380 

actgacttac cctttatagg tgatattagt gatgttaaaa ctgatatatt tttaagaaaa 1440 

gatattaatg aagaaactga agttatatac tatccggaca atgtttcagt agatcaagtt 1500 

attctcagta agaatacctc agaacatgga caactagatt tattataccc tagtattgac 1560 

agtgagagtg aaatattacc aggggagaat caagtctttt atgataatag aactcaaaat 1620 

gttgattatt tgaattctta ttattaccta gaatctcaaa aactaagtga taatgttgaa 1680 

gattttactt ttacgagatc aattgaggag gctttggata atagtgcaaa agtatatact 1740 

tactttccta cactagctaa taaagtaaat gcgggtgttc aaggtggttt atttttaatg 1800 

tgggcaaatg atgtagttga agattttact acaaatattc taagaaaaga tacattagat 1860 

aaaatatcag atgtatcagc tattattccc tatataggac ccgcattaaa tataagtaat 1920 

tctgtaagaa gaggaaattt tactgaagca tttgcagtta ctggtgtaac tattttatta 1980 
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gaagcatttc 


ctgaatttac 


aatacctgca 


cttggtgcat 


ttgtgattta tagtaaggtt 


2040 


caagaaagaa 


acgagattat 


taaaactata 


gataattgtt 


tagaacaaag gattaagaga 


2100 


tggaaagatt 


catatgaatg 


gatgatggga 


acgtggttat 


ccaggattat tactcaattt 


2160 


aataatataa 


gttatcaaat 


gtatgattct 


ttaaattatc 


aggcaggtgc aatcaaagct 


2220 


aaaatagatt 


tagaatataa 


aaaatattca 


ggaagtgata 


aagaaaatat aaaaagtcaa 


2280 


gttgaaaatt 


taaaaaatag 


tttagatgta 


aaaatttcgg 


aagcaatgaa taatataaat 


2340 


aaatttatac 


gagaatgttc 


cgtaacatat 


ttatttaaaa 


atatgttacc taaagtaatt 


2400 


gatgaattaa 


atgagtttga 


tcgaaatact 


aaagcaaaat 


taattaatct tatagatagt 


2460 


cataatatta 


ttctagttgg 


tgaagtagat 


aaa'ttiaaaag 


caaaagtaaa taatagcttt 


2520 


caaaatacaa 


taccctttaa 


tattttttca 


tatactaata 


attctttatt aaaagatata 


2580 


attaatgaat 


atttcaatgg 


cggtgggggt 


agtggcggtg 


gcggttcggg cgggggtggg 


2640 


agctttggcg 


gtttcacggg 


cgcacgcaaa 


tcagcg 




2676 



<210> 24 
<211> 892 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 24 

Met Pro He Thr He Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 
15 10 15 



Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 
20 25 30 



Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Asp 
35 40 45 



Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val 
. 50 55 60 



Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 
65 70 75 80 



Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys Leu Phe Lys Arg 
85 90 95 



.^•leiAen Ser Arg Glu J.le>GlyV Gl\x Glu Leu He Tyr Arg Leu Ser Thr 
100 105 110 
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Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He Asn Thr Phe Asp 
115 120 125 



Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gin Gly Asn 
130 135 140 



Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val He He Thr Gly 
145 150 155 160 



Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 
165 170 175 



Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Leu Ser He He 
180 185 190 



Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 
195 200 205 



Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro He 
210 215 220 



Leu He Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 
225 230 235 240 



He Ala He Pro Asn Asp Gin Thr He Ser Ser Val Thr Ser Asn He 
245 250 255 



Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 
260 265 270 



Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser Ala Arg Lys Tyr 
275 280 285 



Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala Lys Arg Leu 
290 295 300 



Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr He Gly 
305 310 315 320 



Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu Ser 
325 330. 335 



Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 
tL . ir.- 3*40-" tt i^. . .'^345 ^ . 350 
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Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala Lys He Tyr Asn 
355 360 365 



Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 
370 375 380 



Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin Asn Gly Phe Asn 
385 390 395 400 



He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 
405 410 415 



Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 



Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 445 



Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 
450 455 460 



Phe He Gly Asp He Ser Asp Val Lys Thr Asp He Phe* Leu Arg Lys 
465 470 475 480 



Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 



Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu 
500 505 510 



Asp Leu Leu Tyr Pro. Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 



dlu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 



Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
545 550 555 560 



Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 
565 570 575 



Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 
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Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 



Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 



Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn 
625 630 635 640 



Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 



Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 



Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys 
675 680 685 



Thr He Asp Asn Cys Glu Gin Arg He Lys Arg Trp Lys Asp Ser 
690 695 700 



Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe 
705 710 715 720 



Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
725 730 735 



Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser 
740 745 750 



Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu 
755 760 765 



Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg 
770 775 780 



Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He 
785 790 795 800 



Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn 
805 810 815 



Leu He Asp Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu 
820 825 830 

... ' ... ... ^ -« 

Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He Pro Phe Asn He 
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835 840 845 

Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp lie lie Asn Glu Tyr 
850 855 860 

Phe TVsn Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
865 870 875 880 



Ser Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala 





885 




890 






<210> 25 
<211> 2712 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic 










<400> 25 
tttggcggtt 


tcacgggcgc 


acgcaaatca 


gcgcgtaaat atgctaacca 


gactagtggc 


60 


ggtgggggta 


gtggcggtgg 


cggttcgggc 


gggggtggga gccctagggg 


atccatgcca 


120 


ataacaatta 


acaactttaa 


ttattcagat 


cctgttgata ataaaaatat 


tttatattta 


180 


gatactcatt 


taaatacact 


agctaatgag 


cctgaaaaag cctttcgcat 


tacaggaaat 


240 


atatgggtaa 


tacctgatag 


attttcaaga 


aattctaatc caaatttaaa 


taaacctcct 


300 


cgagttacaa 


gccctaaaag 


tggttattat 


gatcctaatt atttgagtac 


tgattctgac 


360 


aaagatacat 


ttttaaaaga 


aattataaag 


ttatttaaaa gaattaattc 


tagagaaata 


420 


ggagaagaat 


taatatatag 


actttcgaca 


gatataccct ttcctgggaa 


taacaatact 


480 


ccaattaata 


cctttgattt 


tgatgtagat 


tttaacagtg ttgatgttaa 


aactagacaa 


540 


ggtaacaact 


gggttaaaac 


tggtagcata 


aatcctagtg ttataataac 


tggacctaga 


600 


gaaaacatta 


tagatccaga 


aacttctacg 


tttaaattaa ctaacaatac 


ctttgcggca 


660 


caagaaggat 


ttggtgcttt 


atcaataatt 


tcaatatcac ctagatttat 


gctaacatat 


720 


agtaatgcaa 


ctaatgatgt 


aggagagggt 


agattttcta agtctgaatt 


ttgcatggat 


780 


ccaatactaa 


ttttaatgca 


tgaacttaat 


catgcaatgc ataatttata 


tggaatagct 


840 


ataccaaatg 


atcaaacaat 


ttcatctgta 


actagtaata ttttttattc 


tcaatataat 


900 


gtgaaattag 


agtatgcaga 


aatatatgca 


tttggaggtc caactataga 


ccttattcct 


960 


aaaagtgcaa 


ggaaatattt 


tgaggaaaag 


gcattggatt attatagatc 


tatagctaaa 


1020 


agacttaata 


gtataactac 


tgcaaatcct 


tcaagcttta ataaatatat 


aggggaatat 


1080 


aaacagaaac 


ttattagaaa 


gtatagattc 


gtagtagaat cttcaggtga 


agttacagta 


1140 
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aatcgtaata 


agtttgttga 


gttatataat 


gaacttacac 


aaatatttac 


agaatttaac 


1200 


tacgctaaaa 


tatataatgt 


acaaaatagg 


aaaatatatc 


tttcaaatgt 


atatactccg 


1260 


gttacggcga 


atatattaga 


cgataatgtt 


tatgatatac 


aaaatggatt 


taatatacct 


1320 


aaaagtaatt 


taaatgtact 


atttatgggt 


caaaatttat 


ctcgaaatcc 


agcattaaga 


1380 


aaagtcaatc 


ctgaaaatat 


gctttattta 


tttacaaaat 


tttgtcataa 


agcaatagat 


1440 


ggtagatcat 


tatataataa 


aacattagat 


tgtagagagc 


ttttagttaa 


aaatactgac 


1500 


ttacccttta 


taggtgatat 


tagtgatgtt 


aaaactgata 


tatttttaag 


aaaagatatt 


1560 


aatgaagaaa 


ctgaagttat 


atactatccg 


gacaatgttt 


cagtagatca 


agttattctc 


1620 


agtaagaata 


cctcagaaca 


tggacaacta 


gatttattat 


accctagtat 


tgacagtgag 


1680 


agtgaaatat 


taccagggga 


gaatcaagtc 


ttttatgata 


atagaactca 


aaatgttgat 


1740 


tatttgaatt 


cttattatta 


cctagaatct 


caaaaactaa 


gtgataatgt 


tgaagatttt 


1800 


acttttacga 


gatcaattga 


ggaggctttg 


gataatagtg 


caaaagtata 


tacttacttt 


1860 


cctacactag 


ctaataaagt 


aaatgcgggt 


gttcaaggtg 


gtttattttt 


aatgtgggca 


1920 


aatgatgtag 


ttgaagattt 


tactacaaat 


attctaagaa 


aagatacatt 


agataaaata 


1980 


tcagatgtat 


cagctattat 


tccctatata 


ggacccgcat 


taaatataag 


taattctgta 


2040 


agaagaggaa 


attttactga 


agcatttgca 


gttactggtg 


taactatttt 


attagaagca 


2100 


tttcctgaat 


ttacaatacc 


tgcacttggt 


gcatttgtga 


tttatagtaa 


ggttcaagaa 


2160 


agaaacgaga 


ttattaaaac 


tatagataat 


tgtttagaac 


aaaggattaa 


gagatggaaa 


2220 


gattcatatg 


aatggatgat 


gggaacgtgg 


ttatccagga 


ttattactca 


atttaataat 


2280 


ataagttatc 


aaatgtatga 


ttctttaaat 


tatcaggcag 


gtgcaatcaa 


agctaaaata 


2340 


gatttagaat 


ataaaaaata 


ttcaggaagt 


gataaagaaa 


atataaaaag 


tcaagttgaa 


2400 


aatttaaaaa 


atagtttaga 


tgtaaaaatt 


tcggaagcaa 


tgaataatat 


aaataaattt 


2460 


atacgagaat 


gttccgtaac 


atatttattt 


aaaaatatgt 


tacctaaagt 


aattgatgaa 


2520 


ttaaatgagt 


ttgatcgaaa 


tactaaagca 


aaattaatta 


atcttataga 


tagtcataat 


2580 


attattctag 


ttggtgaagt 


agataaatta 


aaagcaaaag 


taaataatag 


ctttcaaaat 


2640 


acaataccct 


ttaatatttt 


ttcatatact 


aataattctt 


tattaaaaga 


tataattaat 


2700 


gaatatttca 


at 










2712 



<210> 26 

<211> 904 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Synthetic 
<400> 26 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Tyr Ala Asn 
15 10 15 



Gin Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
20 25 30 



Gly Ser Pro Arg Gly Ser Met Pro lie Thr lie Asn Asn Phe Asn Tyr 
35 40 45 



Ser Asp Pro Val Asp Asn Lys Asn lie Leu Tyr Leu Asp Thr His Leu 
50 55 60 



Asn Thr Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg lie Thr Gly Asn 
65 70 75 80 



He Trp Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu 
85 90 95 



Asn Lys Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro 
100 105 110 



Asn Tyr Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He 
115 120 125 



He Lys Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu 
130 135 140 



He Tyr Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr 
145 150 155 160 



Pro He Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val 
165 170 175 



Lys Thr Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro 
180 185 190 



Ser Val He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr 
195 200 205 



Ser Thr Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe 
210 215 220 



wo 2006/059105 



35 



PCT/GB2005/004598 



Gly Ala Leu Ser lie He Ser He 
225 230 



Ser Asn Ala Thr Asn Asp Val Gly 
245 



Phe Cys Met Asp Pro He Leu He 
.2.60 



Met His Asn Leu Tyr Gly He Ala 
275 280 



Ser Val Thr Ser Asn He Phe Tyr 
290 295 



Tyr Ala Glu He Tyr Ala Phe Gly 
305 310 



Lys Ser Ala Arg Lys Tyr Phe Glu 
325 



Ser He Ala Lys Arg Leu Asn Ser 
340 



Phe Asn Lys Tyr He Gly Glu Tyr 
355 360 



Arg Phe Val Val Glu Ser Ser Gly 
370 375 



Phe Val Glu Leu Tyr Asn Glu Leu 
385 390 



Tyr Ala 



Val Tyr Thr Pro Val Thr Ala Asn 
420 



He Gin Asn Gly Phe Asn He Pro 
435 440 



Met Gly Gin Asn Leu Ser Arg Asn 
450 455 



Glu Asn Met Leu Tyr Leu Phe Thr 



Ser Pro Arg Phe Met Leu Thr Tyr 
235 240 



Glu Gly Arg Phe Ser Lys Ser Glu 
250 255 



Leu Met His Glu Leu Asn His Ala 
265 . 270 



He Pro Asn Asp Gin Thr He Ser 
285 



Ser Gin Tyr Asn Val Lys Leu Glu 
300 



Gly Pro Thr He Asp Leu He Pro 
315 320 



Glu Lys Ala Leu Asp Tyr Tyr Arg 
330 335 



He Thr Thr Ala Asn Pro Ser Ser 
345 350 



Lys Gin Lys Leu He Arg Lys Tyr 
365 



Glu Val Thr Val Asn Arg Asn Lys 
380 



Thr Gin He Phe Thr Glu Phe Asn 
395 400 



He Leu Asp Asp Asn Val Tyr Asp 
425 430 



Lys Ser Asn Leu Asn Val Leu Phe 
445 ■ 



Pro Ala Leu Arg Lys Val Asn Pro 
460 



Lys Phe Cys His Lys Ala He Asp 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn 
405 410 ' 415 
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465 470 475 480 



Gly Arg Ser Leu Tyr Asn Lys Thr Leu Asp Cys Arg Glu Leu Leu Val 
485 490 495 



Lys Asn Thr Asp Leu Pro Phe He Gly Asp He Ser Asp Val Lys Thr 
500 505 510 



Asp He Phe Leu Arg Lys Asp He Asn Glu Glu Thr Glu Val He Tyr 
515 520 525 



Tyr Pro Asp Asn Val Ser Val Asp Gin Val He Leu Ser Lys Asn Thr 
530 535 540 



Ser Glu His Gly Gin Leu Asp Leu Leu Tyr Pro Ser He Asp Ser Glu 
545 550 555 560 



Ser Glu He Leu Pro Gly Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr 
565 570 575 



Gin Asn Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys 
580 585 590 



Leu Ser Asp Asn Val Glu Asp Phe Thr Phe Thr Arg Ser He Glu Glu 
595 600 605 



Ala Leu Asp Asn Ser Ala Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala 
610 615 620 



Asn Lys Val Asn Ala Gly Val Gin Gly Gly Leu Phe Leu Met Trp Ala 
625 630 635 640 



Asn Asp Val Val Glu Asp Phe Thr Thr Asn He Leu Arg Lys Asp Thr 
645 650 655 



Leu Asp Lys He Ser Asp Val Ser Ala He He Pro Tyr He Gly Pro 
660 665 670 



Ala Leu Asn He Ser Asn Ser Val Arg Arg Gly Asn Phe Thr Glu Ala 
675 ' 680 685 



Phe Ala Val Thr Gly Val Thr He Leu Leu Glu Ala Phe Pro Glu Phe 
690 695 700 



Thr He Pro Ala Leu Gly Ala Phe Val He Tyr Ser Lys Val Gin Glu 
705 710 715 720 
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Arg Asn Glu He He Lys Thr He Asp Asn Cys Leu Glu Gin Arg He 
725 730 735 



Lys Arg Trp Lys Asp Ser Tyr Glu Trp Met Met Gly Thr Trp Leu Ser 
740 745 750 



Arg He He Thr Gin Phe Asn Asn He Ser Tyr Gin Met Tyr Asp Ser 
755 760 765 



Leu Asn Tyr Gin Ala Gly Ala He Lys Ala Lys He Asp Leu Glu Tyr 
770 775 780 



Lys Lys Tyr Ser Gly Ser Asp Lys Glu Asn He Lys Ser Gin Val Glu 
785 790 795 800 



Asn Leu Lys Asn Ser Leu Asp Val Lys He Ser Glu Ala Met Asn Asn 
805 810 815 



He Asn Lys Phe He Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn 
820 825 830 



Met Leu Pro Lys Val He Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr 
835 840 845 



Lys Ala Lys Leu He Asn Leu He Asp Ser His Asn He He Leu Val 
850 855 860 



Gly Glu Val Asp Lys Leu Lys Ala Lys Val Asn Asn Ser Phe Gin Asn 
865 870 875 880 



Thr He Pro Phe Asn He Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys 
885 890 895 



Asp He He Asn Glu Tyr Phe Asn 
900 



<210> 27 

<211> 1302 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 27 

ggatccatgg agttcgttaa caaac^gttj: -aac-tat^aag accga^ cggtgttgac 60 
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attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


Sgcggttcta 


ccatcgatac 


cgaactgaaa gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc ggcaaattct 


ctgtagacaa gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt tacaccgaag 


acaacttcgt 


taagttcttt 


1080 






ctafcctcra'ac tt'caacaacicr 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg ctgtgcgtcg 


ac 




1302 



<210> 28 • 
<211> 1257 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 28 

ctgcagtgta tcaaggttaa caactgggat ttattcttca gcccgagtga agacaacttc 60 
accaacgacc tgaacaaagg tgaagaaatc acctcagata ctaacatcga agcagccgaa 120 
gaaaacatct cgctggacct gatccagcag tactacctga cctttaattt cgacaacgag 180 
ccggaaaaca tttctatcga aaacctgagc tctgatatca tcggccagct ggaactgatg 240 
ccgaacatcg aacgtttccc aaacggtaaa aagtacgagc tggacaaata taccatgttc 300 



wo 2006/059105 



39 



PCT/GB2005/004S98 



cactacctgc 


gcgcgcagga 


atttgaacac 


ggcaaatccc 


gtatcgcact gactaactcc 


360 


gttaacgaag 


ctctgctcaa 


cccgtcccgt 


gtatacacct 


tcttctctag cgactacgtg 


420 


aaaaaggtca 


acaaagcgac 


tgaagctgca 


atgttcttgg 


gttgggttga acagcttgtt 


480 


tatgatttta 


ccgacgagac 


gtccgaagta 


tctactaccg 


acaaaattgc ggatatcact 


540 


atcatcatcc 


cgtacatcgg 


tccggctctg 


aacattggca 


acatgctgta caaagacgac 


600 


ttcgttggcg 


cactgatctt 


ctccggtgcg 


gtgatcctgc 


tggagttcat cccggaaatc 


660 


gccatcccgg 


tactgggcac 


ctttgctctg 


gtttcttaca 


ttgcaaacaa ggttctgact 


720 


gtacaaacca 


tcgacaacgc 


gctgagcaaa 


cgtaacgaaa 


aatgggatga agtttacaaa 


780 


tatatcgtga 


ccaactggct 


ggctaaggtt 


aatactcaga 


tcgacctcat ccgcaaaaaa 


840 


atgaaagaag 


cactggaaaa 


ccaggcggaa 


gctaccaagg 


caatcattaa ctaccagtac 


900 


aaccagtaca 


ccgaggaaga 


aaaaaacaac 


atcaacttca 


acatcgacga tctgtcctct 


960 


aaactgaacg 


aatccatcaa 


caaagctatg 


atcaacatca 


acaagttcct gaaccagtgc 


1020 


tctgtaagct 


atctgatgaa 


ctccatgatc 


ccgtacggtg 


ttaaacgtct ggaggacttc 


1080 


gatgcgtctc 


tgaaagacgc 


cctgctgaaa 


tacatttacg 


acaaccgtgg cactctgatc 


1140 


ggtcaggttg 


atcgtctgaa 


ggacaaagtg 


aacaatacct 


tatcgaccga catccctttt 


1200 


cagctcagta 


aatatgtcga 


taaccaacgc 


cttttgtcca 


ctctagacta gaagctt 


1257 



<210> 29 
<211> 1323 
<212> DNA 

<213> Artificial' Sequence 
<220> 

<223> Synthetic 
<400> 29 

ggatccatgc cggttaccat caacaacttc aactacaacg acccgatcga caacaacaac 60 
atcattatga tggaaccgcc gttcgcacgt ggtaccggac gttactacaa ggcttttaag 120 
atcaccgacc gtatctggat catcccggaa cgttacacct tcggttacaa acctgaggac 180 
ttcaacaaga gtagcgggat tttcaatcgt gacgtctgcg agtactatga tccagattat 240 
ctgaatacca acgataagaa gaacatattc cttcagacta tgattaaact cttcaaccgt 300 
atcaaaagca aaccgctcgg tgaaaaactc ctcgaaatga ttatcaacgg tatcccgtac 360 
ctcggtgacc gtcgtgtccc gcttgaagag ttcaacacca acatcgcaag cgtcaccgtc 420 
aacaaactca tcagcaaccc aggtgaagtc gaacgtaaaa aaggtatctt cgcaaacctc 480 
v.atcatctticg ^tccgggticc ggtectcaac ga ccatcgacat cggtatccag 540 
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aaccacttcg 


caagccgtga 


aggtttcggt ggtatcatgc 


agatgaaatt ctgcccggaa 


600 


tacgtcagtg 


tcttcaacaa 


cgtccaggaa 


aacaaaggtg caagcatctt caaccgtcgt 


660 


ggttacttca 


gcgacccggc 


actcatcctc 


atgcatgaac 


tcatccacgt cctccacggt 


720 


ctctacggta 


tcaaagttga 


cgacctcccg 


atcgtcccga 


acgagaagaa attcttcatg 


780 


cagagcaccg 


acgcaatcca 


ggctgaggaa 


ctctacacct 


tcggtggcca agacccaagt 


840 


atcataaccc 


cgtccaccga 


caaaagcatc 


tacgacaaag 


tcctccagaa cttcaggggt 


900 


atcgtggaca 


gactcaacaa 


agtcctcgtc 


tgcatcagcg 


acccgaacat caatatcaac 


960 


atatacaaga 


acaagttcaa 


agacaagtac 


aaattcgtcg 


aggacagcga aggcaaatac 


1020 


agcatcgacg 


tagaaagttt 


cgacaagctc 


tacaaaagcc 


tcatgttcgg tttcaccgaa 


1080 


accaacatcg 


ccgagaacta 


caagatcaag 


acaagggcaa 


gttacttcag cgacagcctc 


1140 


ccgcctgtca 


aaaticaagaa 


cctcttagac 


aacgagattt 


acacaattga agagggcttc 


1200 


aacatcagtg 


acaaagacat 


ggagaaggaa 


tacagaggtc 


agaacaaggc tatcaacaaa 


1260 


caggcatacg 


aggagatcag 


caaagaacac 


ctcgcagtct 


acaagatcca gatgtgcgtc 


1320 


gac 










1323 



<210> 30 
<211> 1260 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 30 

ctgcagtgca tcgacgttga caacgaagac ctgttcttca tcgctgacaa aaacagcttc 60 
agtgacgacc tgagcaaaaa cgaacgtatc gaatacaaca cccagagcaa ctacatcgaa 120 
aacgacttcc cgatcaacga actgatcctg gacaccgacc tgataagtaa aatcgaactg 180 
ccgagcgaaa acaccgaaag tctgaccgac ttcaacgttg acgttccggt ttacgaaaaa 240 
cagccggcta tcaagaaaat cttcaccgac gaaaacacca tcttccagta cctgtacagc 300 
cagaccttcc cgctggacat ccgtgacatc agtctgacca gcagtttcga cgacgctctg 360 
ctgttcagca acaaagttta cagtttcttc agcatggact acatcaaaac cgctaacaaa 420 
gttgttgaag cagggctgtt cgctggttgg gttaaacaga tcgttaacga cttcgttatc 480 
gaagctaaca aaagcaacac tatggacaaa atcgctgaca tcagtctgat cgttccgtac 540 
atcggtctgg ctctgaacgt tggtaacgaa accgctaaag gtaactttga aaacgctttc 600 
gagatcgctg gtgcaagcat cctgctggag ttcatcccgg aactgctgat cccggttgtt 660 
ggtgctttcc tgctggaaag ttacatcgac aacaaaaaca agatcatcaa aaccatcgac 720 
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aacgctctga ccaaacgtaa cgaaaaatgg agtgatatgt acggtctgat cgttgctcag 780 

tggctgagca ccgtcaacac ccagttctac accatcaaag aaggtatgta caaagctctg 840 

aactaccagg ctcaggctct ggaagagatc atcaaatacc gttacaacat ctacagtgag 900 

aaggaaaaga gtaacatcaa catcgacttc aacgacatca acagcaaact gaacgaaggt 960 

atcaaccagg ctatcgacaa catcaacaac ttcatcaacg gttgcagtgt tagctacctg 1020 

atgaagaaga tgatcccgct ggctgttgaa aaactgctgg acttcgacaa caccctgaaa 1080 

aagaacctgc tgaactacat cgacgaaaac aagctgtacc tgatcggtag tgctgaatac 1140 

gaaaaaagta aagtgaacaa atacctgaag accatcatgc cgttcgacct gagtatctac 1200 

accaacgaca ccatcctgat cgaaatgttc aacaaataca actctctaga ctagaagctt 1260 

<210> 31 

<211> 1329 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 

<400> 31 



ggatccgaat 


tcatgccgat 


caccatcaac aacttcaact acagcgatcc ggtggataac 


60 


aaaaacatcc 


tgtacctgga 


tacccatctg aataccctgg cgaacgaacc ggaaaaagcg 


120 


tttcgtatca 


ccggcaacat 


ttgggttatt ccggatcgtt ttagccgtaa cagcaacccg 


180 


aatctgaata 


aaccgccgcg 


tgttaccagc ccgaaaagcg gttattacga tccgaactat 


240 


ctgagcaccg 


atagcgataa 


agataccttc ctgaaagaaa tcatcaaact gttcaaacgc 


300 


atcaacagcc 


gtgaaattgg 


cgaagaactg atctatcgcc tgagcaccga tattccgttt 


360 


ccgggcaaca 


acaacacccc 


gatcaacacc tttgatttcg atgtggattt caacagcgtt 


420 


gatgttaaaa 


cccgccaggg 


taacaattgg gtgaaaaccg gcagcattaa cccgagcgtg 


480 


attattaccg 


gtccgcgcga 


aaacattatt gatccggaaa ccagcacctt taaactgacc 


540 


aacaacacct 


ttgcggcgca 


ggaaggtttt ggcgcgctga gcattattag cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc aacgatgttg gtgaaggccg tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc ctgatgcatg aactgaacca tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat cagaccatta gcagcgtgac cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa tatgcggaaa tctatgcgtt tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc aaatacttcg aagaaaaagc gctggattac 


900 


^ .ts.tcgcagca 


ttgcgaaacg 


tctgaacagc attaccaccg cgaatccgag cagcttcaac 


960 
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aaatatatcg gcgaatataa acagaaactg atccgcaaat atcgctttgt ggtggaaagc 1020 

agcggcgaag ttaccgttaa ccgcaataaa ttcgtggaac tgtacaacga actgacccag 1080 

atcttcaccg aatttaacta tgcgaaaatc tataacgtgc agaaccgtaa aatctacctg 1140 

agcaacgtgt ataccccggt gaccgcgaat attctggatg ataacgtgta cgatatccag 1200 

aacggcttta acatcccgaa aagcaacctg aacgttctgt ttatgggcca gaacctgagc 1260 

cgtaatccgg cgctgcgtaa agtgaacccg gaaaacatgc tgtacctgtt caccaaattt 1320 

tgcgtcgac 1329 

<210> 32 
<211> 1263 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 32 

ctgcagtgtc gtgaactgct ggtgaaaaac accgatctgc cgtttattgg cgatatcagc 60 

gatgtgaaaa ccgatatctt cctgcgcaaa gatatcaacg aagaaaccga agtgatctac 120 

tacccggata acgtgagcgt tgatcaggtg atcctgagca aaaacaccag cgaacatggt 180 

cagctggatc tgctgtatcc gagcattgat agcgaaagcg aaattctgcc gggcgaaaac 240 

caggtgtttt acgataaccg tacccagaac gtggattacc tgaacagcta ttactacctg 300 

gaaagccaga aactgagcga taacgtggaa gattttacct ttacccgcag cattgaagaa 360 

gcgctggata acagcgcgaa agtttacacc tattttccga ccctggcgaa caaagttaat 420 

gcgggtgttc agggcggtct gtttctgatg tgggcgaacg atgtggtgga agatttcacc 480 

accaacatcc tgcgtaaaga taccctggat aaaatcagcg atgttagcgc gattattccg 540 

tatattggtc cggcgctgaa cattagcaat agcgtgcgtc gtggcaattt taccgaagcg 600 

tttgcggtta ccggtgtgac cattctgctg gaagcgtttc cggaatttac cattccggcg 660 

ctgggtgcgt ttgtgatcta tagcaaagtg caggaacgca acgaaatcat caaaaccatc 720 

gataactgcc tggaacagcg tattaaacgc tggaaagata gctatgaatg gatgatgggc 780 

acctggctga gccgtattat cacccagttc aacaacatca gctaccagat gtacgatagc 840 

ctgaactatc aggcgggtgc gattaaagcg aaaatcgatc tggaatacaa aaaatacagc 900 

ggcagcgata aagaaaacat caaaagccag gttgaaaacc tgaaaaacag cctggatgtg 960 

aaaattagcg aagcgatgaa taacatcaac aaattcatcc gcgaatgcag cgtgacctac 1020 

ctgttcaaaa acatgctgcc gaaagtgatc gatgaactga acgaatttga tcgcaacacc 1080 

aaagcgaaac tgatcaacct gatcgatagc cacaacatta ttctggtggg cgaagtggat 1140 
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aaactgaaag cgaaagttaa caacagcttc cagaacacca tcccgtttaa catcttcagc 1200 
tataccaaca acagcctgct gaaagatatc atcaacgaat acttcaatct agactagaag 1260 
ctt 1263 

<210> 33 
<211> 207 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 33 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat cgaaggtcgt 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 



<210> 34 
<211> 108 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 34 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
aacaaagcgc tgaacctgca gacgcacggt ctagaatgat aaaagctt 108 



<210> 35 
<211> 186 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 35 

catatgaata acctcgggat tgagggtcgt tttggcggtt tcacgggcgc acgcaaatca 60 
gcgcgtaaat tagctaacca gactagtggc ggtgggggta gtggcggtgg cggttcgggc 120 
9999gtggga gccctagggg atccgtcgac ctgcagggtc tagaagcgct agcgtgataa 180 
aagctt 186 



<210> 36 

„<211> ■ 180 *u-r. . * ii-, "^z:r. -^v-. 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 36 

ggatccacgc acgtcgacgc gattgatggt cgttttggcg gtttcacggg cgcacgcaaa 60 
tcagcgcgta aattagctaa ccaggcgcta gcgggcggtg gcggtagcgg cggtggcggt 120 
agcggcggtg gcggtagcgc actagtgctg cagacgcacg gtctagaatg ataaaagctt 180 



<210> 37 
<211> 249 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 37 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat cgaaggtcgt 60 
tacggtggtt tcatgacctc tgaaaaatct cagaccccgc tggttaccct gttcaaaaac 120 
gctatcatca aaaacgctta caaaaaaggt gaagcgctag cgggtggtgg tggttctggt 180 
9gtggtggtt ctggtggtgg tggttctgca ctagtgctgc agacgcacgg tctagaatga 240 
taaaagctt 249 



<210> 38 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 38 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat cgaaggtcgt 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gtaagaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 



<210> 39 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



.^<40^>: 39 ' . • - r. ' . - 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
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attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


1 OA 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 




ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tga teat egg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc * 


1860 
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ttctctagcg 


actacgtgaa 


aaaggtcaac aaagcgactg aagctgcaat gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc gacgagacgt ccgaagtatc tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc cggctctgaa cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca 


ctgatcttct ccggtgcggt gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


ctgggcacct ttgctctggt ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc 


gacaacgcgc tgagcaaacg taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc 


aactggctgg ctaaggttaa tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca 


ctggaaaacc aggcggaagc taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc 


gaggaagaaa aaaacaacat caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca aagctatgat caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat 


ctgatgaact ccatgatccc gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg aaagacgccc tgctgaaata catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat 


cgtctgaagg acaaagtgaa caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa tatgtcgata accaacgcct tttgtccact 


2700 


ctagactag 








2709 



<210> 40 

<211> 902 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > Synt he t i c 
<400> 40 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 

Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 ■ 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly. He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala- Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
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325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 ^ 455 460 

Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 



Gly Gin Leu Glu Leu -^Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 
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Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 ♦ 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys lie Ala Asp lie Thr lie lie lie Pro Tyr lie 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leii Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 

, -v ^r-. 3. a05\ ,^vti. ^.,^7-::.. c 810. uV:'.*^ r 815 
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lie Asn He Asn hys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 41 

<211> 2736 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 41 



ctcgggattg 


agggtcgttt 


tggcggtttc 


acgggcgcac 


gcaaatcagc 


gcgtaaatta 


60 


gctaaccaga 


ctagtggcgg 


tgggggtagt 


ggcggtggcg 


gttcgggcgg 


gggtgggagc 


120 


cctaggggat 


ccatggagtt 


cgttaacaaa 


cagttcaact 


ataaagaccc 


agttaacggt 


180 


gttgacattg 


cttacatcaa 


aatcccgaac 


gctggccaga 


tgcagccggt 


aaaggcattc 


240 


aaaatccaca 


acaaaatctg 


ggttatcccg 


gaacgtgata 


cctttactaa 


cccggaagaa 


300 


ggtgacctga 


acccgccacc 


ggaagcgaaa 


caggtgccgg 


tatcttacta 


tgactccacc 


360 


tacctgtcta 


ccgataacga 


aaaggacaac 


tacctgaaag 


gtgttactaa 


actgttcgag 


420 


cgtatttact 


ccaccgacct 


gggccgtatg 


ctgctgacta 


gcatcgttcg 


cggtatcccg 


480 


ttctggggcg 


gttctaccat 


cgataccgaa 


ctgaaagtaa 


tcgacactaa 


ctgcatcaac 


540 


gttattcagc 


cggacggttc 


ctatcgttcc 


gaagaactga 


acctggtgat 


catcggcccg 


600 


tctgctgata 


tcatccagtt 


cgagtgtaag 


agctttggtc 


acgaagttct 


gaacctcacc 


660 


cgtaacggct 


acggttccac 


tcagtacatc 


cgtttctctc 


cggacttcac 


cttcggtttt 


720 


gaagaatccc 


tggaagtaga 


cacgaaccca 


ctgctgggcg 


ctggtaaatt 


cgcaactgat 


780 


cctgcggtta 


ccctggctca 


cgaactgatt 


catgcaggcc 


accgcctgta 


cggtatcgcc 


840 
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3.tcaEtccgci 


Wwv»y ^y u^ u u 


f^aaacft"^aar* 


awwaaury wy u 


au uauyay ciu 


y uu.u.y y uu. uy 


900 


craacf 1 1 a.ac t 


t caa a era a r** i" 


y \i<»y u u u u 


yy v-yy t-corCy 


a c*nr*^ aa a ♦* ^ 
au^y wuaaa u u 


o a ^ orra r» t* r» t" 
u.cLuv.y ciu. uu» u 


7 O V 




Mwy ay u L>\.#v«y 


uy uaw« uciu* 


^ a t~ a a r«a arrt* 
L.auctciv.«aa.y u 


f" a a a na ^ a t" 
ucaaaycLUctu 


oio»o»a 4- ot/^9 /^r* 


1 n^n 




\»y cicict u w(..*cl u 


wy tyyyuacc 


au.uycuL.ccc 


^ ot a or4~ a o a 4- 

uccay uaCau 


f^a a or 9 a o»#*t4- 4* 

ya.agaacguu 


1 nftn 
±uo u 


u L. c^cicinvjcicici 


a a ucii>.u> uy u 


u- ay u.y day ctu> 


ctu. u. u cu^y y c a 


a a ^ 

aauucuc uy u 


a a a 9 ort" ^ or 

dy dCady u uy 




ClCL CI U U CI UCl 


acLL> u L. uav^ctd 


acii^y w uy ac u 


y aaa u uuaCa 


CCyaayaCda 


ot 4" 4" ^or4* ^ 9 9 or 

cu ucy u uddy 




^uCL.Ul.acLa^ 


♦* +• r*^<*ra s>f*r*/t 
L.uCt.yaaCCy 


^9999^/**4'94' 


^ 4" or a a ^ ^ ^ ^or 

cugaacuucg 


a ^ a a ^/^oi a or ^ 

acaaggcagc 


a 4~ 4" ^ a a a a ^ #^ 

auucaaaauc 






f^f^Si 9 a/*T^^ 9 9 

Cyaaay UX^aa 


^^9^9^4*9^^ 

cuacacuauc 


uacgaugguu 


4" ^ a a ^ <'i 1 ' or ^ or 

ucaaccugcg 


4" a a ot a ot or a a 

uaacaccaac 






auuu uaawyy 


CCayaaCaCy 


or9 a a 4" ^ a s» oi a 

gaaaucaaca 


a a 4* or a a m ^ ^ 

acaugaac u u 


A* a ^ a a a a oi ^ or 

cacaaaacug 


X J o u 


s A ^ ^ ^ ^1 4* 4* ^ ^ 


cuggucuyc u 


cgagtt't'bac 


aagctigctgt 


gcgt.cgacgg 


catcalz't.acc 


X44fcU 


1^ C C a.a.a.a.C ua 


aaccuCuga^ 


s or A a y<»^n4» a A 

agaag g u aga 


— k M ^ ^ -fi ory^orM 

aacaaagcgc 


cgaacgaccu 


c ug uac caag 


1 trnn 


^ !■ S 9 9 a 

UaaCaaCu 


gggauuuacu 


ccucagcccg 


agugaagaca 


acuucaccaa 


cgaccugaac 


1 C£0 


do. ay 2 L. y a o.^ 


aaaUCaCC uC 


9 or a 4" 9 9 9 ^ 

ay auaCuaac 


9 ^ ^Of 9 a or^ a or 

aucyaaycay 


^ ^or a a i^a a a a 

CCy aayaaaa 


^% 9 4* oi^ ot^Y^4*or 

caucucyCug 


1 on 

XO^U 




agcagcacca 


^ ^ ors ^ 4* ^ 

CCCgaCCuCu 


aacu^cgaca 


acgagccgga 


aaacac uuc u 


T Q n 
Xoo U 


a i~ /**^a 9 a a 

a.uCya.a.a.a.CC 


T.y ayCucuya 


cauCaucggc 


^aoro*^orora 9 ^ 

caycuyyaac 


4'ora ^or^^or a a 

ugaugccgaa 


ot a 4> Mor a a ot ^» t' 

caucgaacgu 


1 TA ft 
X /4U 


^ uCCCaaaCy 


guaaaaagua 


cgagcuggac 


aaauauacca 


4* or 4* 4~ a 

ugcuccacua 


ccugcgcgcg 


1 fiAft 

Xouu 


^ a f^ffa ;a 4~ +~ +~ 
Ca.yyaa.ui.uy 


aaCacyycaa 


4* ^ /^/^4" a 4"* 

auCCCyUauC 


a o* 4" or a o* 4* a 

y CaC uyaCua 


aCuCCy uuaa 


cgaagc uccg 


Xoo u 


Ct.CaaCCCyu 


cccgcguaua 




uc uagcgacu 


acgugaaaaa 


ggucaacaaa 


1 Q O A 


r*T^^a ^t*^a ar^ 

ycyad.ya.a.y 


ctgcaaugu^ 


c u tgy gc c gg 


4" ^ or 9 a ^ a or ^ 

gu ugaacagc 


4~ t- or 4- 4- 4* 9 ^ OP 9 

uuguuuauga 


uuuuaccgac 


T Q Q A 


yayacy uccy 


9 9 9 4* ^ 4* 9 ^ 

aagcauccac 


^ 9 /*1 /*«AY a OI9 9 9 

(.accyacaaa 


a ^ 4* or^orora 4" a 

aux^gcggaua 


^ Ota m4> a ^ M a ^ 

ucacuaccau 


caucccgcac 


O AA A 


etuu^yy u>^^^yy 


4- /-i4» r^ra a ^a +■ 

c ucuy aaCa u 


4- oror^a a /^9 4" 

uyy Ca.aCat>y 


o«4"or4~ a ^a a a 
C uy uaCaaay 


aCyaCuuCy u 


+• or or o» or /*t a ot 4~ ot 

uyyCy CdC uy 


OT Aft 


au>wi»u\oU^ ^y 


y ^y ^yy i>»yciL> 


u><^ uy w*uyy ciy 


4* 4- /-« Q 4- c*f^r*nn 
uui.*auu.uu>yy 


a a a 4~ ^ofoo*a 4* 
aaa1..cyccciu 


o» ^ <"TOf 4* 9 r^+*or 

cccyy udcuy 


Z XO U 


y y wau.u. u u uy 


v> uv.*uyy u i»l.u> 


^ a r»at*t*fTr'a 
L> UCi^Ct L. L>y (^O. 


a a <^ a a crcrt" ^ r» 
ciau>ciayy u uu 


uyac uy uaCa 


a a ^ ^ a +" otof a 

ddccd ucydC 




CICH-. y ^^y t. uy a 


9 9 9 e^n^ a a 

ycaaocy L.aa 


o*ora a a a 9 ^ of ^ 
Cy aaaaauyy 


yauy day u uu 


90t999i*94'94' 

acaaauauau 


otor^ OP a ^ a a ^ 

cgugaccaac 




'^13 23^ uyy V* ua 


arrni*" t* aa<" ar* 
ay y l. uaauat^ 


i^a#TPi +" ^na <^ 
u^ay ctuu-y av« 


^uv*auu>uyu>a 


a a a a a ai*rta a 
aaaaaauy aa 


a or a a Of ot a o« ^ or 

dy ddy L^du. uy 




Q&a.a.af*! p a crcr 


w y y a ciy So u a 


I.* Aayy waa uv>» 


a t* t* aar* fc ar* f 


a O't" a r*a a r»a 
ay uauaauug 


rtl~ a r'a r^OfTarr 
y uau.d^^y dy 


9400 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt 


cctctaaact 


gaacgaatcc 


2460 


atcaacaaag 


ctatgatcaa 


catcaacaag 


ttcctgaacc 


agtgctctgt 


aagctatctg 


2520 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg 


acttcgatgc 


gtctctgaaa 


2580 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc 


tgatcggtca 


ggttgatcgt 


2640 
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ctgaaggaca aagtgaacaa taccttatcg accgacatcc cttttcagct cagtaaatat 2700 
gtcgataacc aacgcctttt gtccactcta gactag 2736 



<210> 42 
<211> 911 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 42 

Leu Gly lie Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser 
15 10 15 



Ala Arg Lys Leu Ala Asn Gin Thr Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 



Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg Gly Ser Met Glu Phe Val 
35 40 45 



Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly Val Asp lie Ala 
50 55 60 



Tyr He Lys He Pro Asn Ala Gly Gin Met Gin Pro Val Lys Ala Phe 
65 70 75 80 



Lys He His Asn Lys He Trp Val He Pro Glu Arg Asp Thr Phe Thr 
85 90 95 



Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala Lys Gin Val 
100 105 110 



Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp Asn Glu Lys 
115 120 125 



Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu Arg He Tyr Ser 
130 135 140 



Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He Val Arg Gly He Pro 
145 150 155 160 



Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys Val He Asp Thr 
165 170 175 



.Asn Cys IJie Asn Val He ,Gln.Pro .Asp ,Giy Ser .Ty;r • Arjg Ser'Glu^Glu 
180 185 190 
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Leu Asn Leu Val He He Gly Pro Ser Ala Asp He He Gin Phe Glu 
195 200 205 



Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr Arg Asn Gly Tyr 
210 215 220 



Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe Thr Phe Gly Phe 
225 230 235 240 



Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly Ala Gly Lys 
245 250 255 



Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu Leu He His Ala 
260 265 270 



Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn Arg Val Phe Lys 
275 280 285 



Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu Val Ser Phe 
290 295 300 



Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe He Asp Ser 
305 310 315 320 



Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys Phe Lys Asp 
325 330 335 



He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val Gly Thr Thr Ala 
340 345 350 



Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr Leu Leu Ser 
355 360 365 



Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys Phe Asp Lys 
370 375 380 



Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp Asn Phe Val Lys 
385 390 395 400 



Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe Asp Lys Ala 
405 410 415 



Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr Thr He Tyr Asp 
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Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe Asn Gly Gin 
435 440 445 



Asn Thr Glu lie Asn Asn Met Asn Phe Thr Lys Leu Lys Asn Phe Thr 
450 455 460 



Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly lie lie Thr 
465 470 475 480 



Ser Lys Thr Lys Ser Leu He Glu Gly Arg Asn Lys Ala Leu Asn Asp 
485 490 495 



Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp 
515 520 525 



Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin 
530 535 540 



Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser 
545 550 555 560 



He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn. Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 . . 605 



Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys 
625 630 635 640 



Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala 
660 665 670 
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Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He Gly 
675 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly 
690 695 700 



Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val 
725 730 735 



Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
770 775 780 



Glu Ala Thr Lys Ala lie He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 ' 825 830 



Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg 
865 870 875 880 



Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin 
885 890 895 



Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 



• ' ;■ - i ■. 
<210> 43 
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<211> 2715 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 43 



ggatccgaat 


tcatgccgat 


caccatcaac 


aacttcaact acagcgatcc 


ggtggataac 


60 


aaaaacatcc 


tgtacctgga 


tacccatctg 


aataccctgg cgaacgaacc 


ggaaaaagcg 


120 


tttcgtatca 


ccggcaacat 


ttgggttatt 


ccggatcgtt ttagccgtaa 


cagcaacccg 


180 


aatctgaata 


aaccgccgcg 


tgttaccagc 


ccgaaaagcg gttattacga 


tccgaactat 


240 


ctgagcaccg 


atagcgataa 


agataccttc 


ctgaaagaaa tcatcaaact 


gttcaaacgc 


300 


atcaacagcc 


gtgaaattgg 


cgaagaactg 


atctatcgcc tgagcaccga 


tattccgttt 


360 


ccgggcaaca 


acaacacccc 


gatcaacacc 


tttgatttcg atgtggattt 


caacagcgtt 


420 


gatgttaaaa 


cccgccaggg 


taacaattgg 


gtgaaaaccg gcagcattaa 


cccgagcgtg 


480 


attattaccg 


gtccgcgcga 


aaacattatt 


gatccggaaa ccagcacctt 


taaactgacc 


540 


aacaacacct 


ttgcggcgca 


ggaaggtttt 


ggcgcgctga gcattattag 


cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc 


aacgatgttg gtgaaggccg 


tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc 


ctgatgcatg aactgaacca 


tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat 


cagaccatta gcagcgtgac 


cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa 


tatgcggaaa tctatgcgtt 


tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc 


aaatacttcg aagaaaaagc 


gctggattac 


900 


tatcgcagca 


ttgcgaaacg 


tctgaacagc 


attaccaccg cgaatccgag 


cagcttcaac 


960 


aaatatatcg 


gcgaatataa 


acagaaactg 


atccgcaaat atcgctttgt 


ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa 


ttcgtggaac tgtacaacga 


actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc 


tataacgtgc agaaccgtaa 


aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat 


attctggatg ataacgtgta 


cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg 


aacgttctgt ttatgggcca 


gaacctgagc 


1260 


cgtaatccgg 


cgctgcgtaa 


agtgaacccg 


gaaaacatgc tgtacctgtt 


caccaaattt 


1320 


tgcgtcgacg 


cgatagatgg 


tagatttggc 


ggtttcacgg gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggcggt 


ggcggtagcg gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtcgt 


gaactgctgg tgaaaaacac 


cgatctgccg 


1500 


tttattggcg 


atatcagcga 


tgtgaaaacc 


gatatcttcc tgcgcaaaga 


tatcaacgaa 


1560 


gaaaccgaag 


tgatctacta 


cccggataac 


gtgagcgttg atcaggtgat 


cctgagcaaa 


1620 
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aacaccagcg aacatggtca gctggatctg ctgtatccga gcattgatag cgaaagcgaa 1680 

attctgccgg gcgaaaacca ggtgttttac gataaccgta cccagaacgt ggattacctg 1740 

aacagctatt actacctgga aagccagaaa ctgagcgata acgtggaaga ttttaccttt 1800 

acccgcagca ttgaagaagc gctggataac agcgcgaaag tttacaccta ttttccgacc 1860 

ctggcgaaca aagttaatgc gggtgttcag ggcggtctgt ttctgatgtg ggcgaacgat 1920 

gtggtggaag atttcaccac caacatcctg cgtaaagata ccctggataa aatcagcgat 1980 

gttagcgcga ttattccgta tattggtccg gcgctgaaca ttagcaatag cgtgcgtcgt 2040 

ggcaatttta ccgaagcgtt tgcggttacc ggtgtgacca ttctgctgga agcgtttccg 2100 

gaatttacca ttccggcgct gggtgcgttt gtgatctata gcaaagtgca ggaacgcaac 2160 

gaaatcatca aaaccatcga taactgcctg gaacagcgta ttaaacgctg gaaagatagc 2220 

tatgaatgga tgatgggcac ctggctgagc cgtattatca cccagttcaa caacatcagc 2280 

taccagatgt acgatagcct gaactatcag gcgggtgcga ttaaagcgaa aatcgatctg 2340 

gaatacaaaa aatacagcgg cagcgataaa gaaaacatca aaagccaggt tgaaaacctg 2400 

aaaaacagcc tggatgtgaa aattagcgaa gcgatgaata acatcaacaa attcatccgc 2460 

gaatgcagcg tgacctacct gttcaaaaac atgctgccga aagtgatcga tgaactgaac 2520 

gaatttgatc gcaacaccaa agcgaaactg atcaacctga tcgatagcca caacattatt 2580 

ctggtgggcg aagtggataa actgaaagcg aaagttaaca acagcttcca gaacaccatc 2640 

ccgtttaaca tcttcagcta taccaacaac agcctgctga aagatatcat caacgaatac 2700 

ttcaatctag actag 2715 



<210> 44 

<211> 904 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 44 

Gly Ser Glu Phe Met Pro lie Thr lie Asn Asn Phe Asn Tyr Ser Asp 
1 5 * 10 15 



Pro Val Asp Asn Lys Asn lie Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg lie Thr Gly Asn lie Trp 

35 - 40 . 45 



wo 2006/059105 



58 



PCT/GB2005/004598 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 70 75 80 



Leu. Ser .Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 
100 105 110 



Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
130 135 140 



Arg Gin Gly Asn Asn Tzrp Val Lys Thr Gly Ser He Asn Pro Ser Val 
145 150 155 160 



He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 



Thr Ser Asn He Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 
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Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser lie 
290 295 300 



Ala Lys Arg Leu Asn Ser lie Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr lie Gly Glu Tyr Lys Gin Lys Leu lie Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin lie Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys lie Tyr Asn Val Gin Asn Arg Lys lie Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn lie Leu Asp Asp Asn Val Tyr Asp lie Gin 
385 390 395 400 



Asn Gly Phe Asn lie Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 430 



Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Ala lie Asp Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys Arg Glu Leu Leu Val Lys Asn 
485 490 495 



Thr Asp Leu Pro Phe lie Gly Asp lie Ser Asp Val Lys Thr Asp lie 
500 505 510 



Phe Leu Arg Lys Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro 
515 520 525 



Asp Asn Val Ser Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu 
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530 535 540 



His Gly Gin heu Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu 
545 550 555 560 



He Leu Pro Gly Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn 
565 570 575 



Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser 
580 585 590 



Asp Asn Val Glu Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu 
595 . 600 605 



Asp Asn Ser Ala Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys 
610 615 620 



Val Asn Ala Gly Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp 
625 630 635 640 



Val Val Glu Asp Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp 
645 650 655 



Lys He Ser Asp Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu 
660 665 670 



Asn He Ser Asn Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala 
675 680 685 



Val Thr Gly Val Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He 
690 695 700 



Pro Ala Leu Gly Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn 
705 710 715 720 



Glu He He Lys Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg 
725 730 735 



Trp Lys Asp Ser Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He 
740 745 750 



He Thr Gin Phe Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn 
755 760 765 



Tyr Gin Ala Gly Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys 
770 775 780 
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Tyr Ser Gly Ser Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu 
785 790 795 800 



Lys Asn Ser Leu Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn 
805 810 815 



Lys Phe He Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu 
820 825 830 



Pro Lys Val He Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala 
835 840 845 



Lys Leu He Asn Leu He Asp Ser His Asn He lie Leu Val Gly Glu 
850 855 860 



Val Asp Lys Leu Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He 
865 870 875 880 



Pro Phe Asn He Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He 
885 890 895 



He Asn Glu Tyr Phe Asn Leu Asp 
900 



<210> 45 

<211> 2742 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Syathetic 

<400> 45 



ggatccgaat tcatgccgat 


caccatcaac 


aacttcaact 


acagcgatcc 


ggtggataac 


60 


aaaaacatcc tgtacctgga 


tacccatctg 


aataccctgg 


cgaacgaacc 


ggaaaaagcg 


120 


tttcgtatca ccggcaacat 


ttgggttatt 


ccggatcgtt 


ttagccgtaa 


cagcaacccg 


180 


aatctgaata aaccgccgcg 


tgttaccagc 


ccgaaaagcg 


gttattacga 


tccgaactat 


240 


ctgagcaccg atagcgataa 


agataccttc 


ctgaaagaaa 


tcatcaaact 


gttcaaacgc 


300 


atcaacagcc gtgaaattgg 


cgaagaactg 


atctatcgcc 


tgagcaccga 


tattccgttt 


360 


ccgggcaaca acaacacccc 


gatcaacacc 


tttgatttcg 


atgtggattt 


caacagcgtt 


420 


gatgttaaaa cccgccaggg 


taacaattgg 


gtgaaaaccg 


gcagcattaa 


cccgagcgtg 


480 


attattaccg gtccgcgcga 


aaacattatt 


gatccggaaa 


..ccagcacctt 


taaactgacc 


540 
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aacaacacct 


ttgcggcgca 


ggaaggtttt 


ggcgcgctga 


gcattattag 


cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc 


aacgatgttg 


gtgaaggccg 


tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc 


ctgatgcatg 


aactgaacca 


tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat 


cagaccatta 


gcagcgtgac 


cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa 


tatgcggaaa 


tctatgcgtt 


tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc 


aaatacttcg 


aagaaaaagc 


gctggattac 


900 


tatcgcagca 


ttgcgaaacg 


tctgaacagc 


attaccaccg 


cgaatccgag 


cagcttcaac 


960 


aaatatatcg 


gcgaatataa 


acagaaactg 


atccgcaaat 


atcgctttgt 


ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa 


ttcgtggaac 


tgtacaacga 


actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc 


tataacgtgc 


agaaccgtaa 


aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat 


attctggatg 


ataacgtgta 


cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg 


aacgttctgt 


ttatgggcca 


gaacctgagc 


1260 


cgtaatccgg 


cgctgcgtaa 


agtgaacccg 


gaaaacatgc 


tgtacctgtt 


caccaaattt 


1320 


tgcgtcgacg 


gcatcattac 


ctccaaaact 


aaatctctga 


tagaaggtag 


atttggcggt 


1380 


ttcacgggcg 


cacgcaaatc 


agcgcgtaaa 


ttagctaacc 


aggcgctagc 


gggcggtggc 


1440 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagcgcac 


tagtgctgca 


gtgtcgtgaa 


1500 


ctgctggtga 


aaaacaccga 


tctgccgttt 


attggcgata 


tcagcgatgt 


gaaaaccgat 


1560 


atcttcctgc 


gcaaagatat 


caacgaagaa 


accgaagtga 


tctactaccc 


ggataacgtg 


1620 


agcgttgatc 


aggtgatcct 


gagcaaaaac 


accagcgaac 


atggtcagct 


ggatctgctg 


1680 


tatccgagca 


ttgatagcga 


aagcgaaatt 


ctgccgggcg 


aaaaccaggt 


gttttacgat 


1740 


aaccgtaccc 


agaacgtgga 


ttacctgaac 


agctattact 


acctggaaag 


ccagaaactg 


1800 


agcgataacg 


tggaagattt 


tacctttacc 


cgcagcattg 


aagaagcgct 


ggataacagc 


1860 


gcgaaagttt 


acacctattt 


tccgaccctg 


gcgaacaaag 


ttaatgcggg 


tgttcagggc 


1920 


ggtctgtttc 


tgatgtgggc 


gaacgatgtg 


gtggaagatt 


tcaccaccaa 


catcctgcgt 


1980 


aaagataccc 


tggataaaat 


cagcgatgtt 


agcgcgatta 


ttccgtatat 


tggtccggcg 


2040 


ctgaacatta 


gcaatagcgt 


gcgtcgtggc 


aattttaccg 


aagcgtttgc 


ggttaccggt 


2100 


gtgaccattc 


tgctggaagc 


gtttccggaa 


tttaccattc 


cggcgctggg 


tgcgtttgtg 


2160 


atctatagca 


aagtgcagga 


acgcaacgaa 


atcatcaaaa 


ccatcgataa 


ctgcctggaa 


2220 


cagcgtatta 


aacgctggaa 


agatagctat 


gaatggatga 


tgggcacctg 


gctgagccgt 


2280 


attatcaccc 


agttcaacaa 


catcagctac 


cagatgtacg 


atagcctgaa 


ctatcaggcg 


2340 


ggtgcgatta 


aagcgaaaat 


cgatctggaa 


tacaaaaaat 


acagcggcag 


cgataaagaa 


2400 
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del cL u v« cLctda 


gccd^^c^ga 


dddCC U^ddd 


a a ^ a r^r* r* ^ t-^r^ a ^<^+*^aaaat* f* a nc^ci^ a c^f*t-t 
da%M>dMC-w dl^^UMCidaau udvjv^^dd^^^ 


^ 'Z V w 


A ^ /~T2a ra -a <"« 

d u 3 ad L. da w a 


UCdaCdddT. U 


Cduccgc^dd 


u^Cd^cgugd udCCU^uu w«dddddwdu^ 




ctgccgaaag 


tgatcgatga 


actgaacgaa 


tttgatcgca acaccaaagc gaaactgatc 


2580 


aacctgatcg 


atagccacaa 


cattattctg 


gtgggcgaag tggataaact gaaagcgaaa 


2640 


gttaacaaca 


gcttccagaa 


caccatcccg 


tttaacatct tcagctatac caacaacagc 


2700 


ctgctgaaag 


atatcatcaa 


cgaatacttc 


aatctagact ag 


2742 



<210> 


46 


<211> 


.913 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


46 



Gly Ser Glu Phe Met Pro lie Thr He Asn Asn Phe Asn Tyr Ser Asp 
15 10 15 



Pro Val Asp Asn Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp 
35 40 45 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 

50 55 ea 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu. Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 
100 105 110 



Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
130 135 140 



Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val 



wo 2006/059105 



64 



PCT/GB2005/004598 



145 150 155 160 



lie lie Thr Gly Pro Arg Glu Asn lie He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 



Thr Ser Asn He Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Tlla Leu Asp Tyr Tyr Arg Ser He 
290 295 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr He Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin 
385 390 " 395 400 
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Asn Gly Phe Asn lie Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 430 



Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Gly lie lie Thr Ser 
435 440 445 



Lys Thr Lys Ser Leu lie Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala 
450 455 460 



Arg Lys Ser Ala Arg Lys Leu Ala Asn Gin Ala Leu Ala Gly Gly Gly 
465 470 475 480 



Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu 
485 490 495 



Gin Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe lie Gly 
500 505 510 



Asp lie Ser Asp Val Lys Thr Asp lie Phe Leu Arg Lys Asp lie Asn 
515 520 525 



Glu Glu Thr Glu Val lie Tyr Tyr Pro Asp Asn Val Ser Val Asp Gin 
530 535 540 



Val lie Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu Asp Leu Leu 
545 550 555 560 



Tyr Pro Ser lie Asp Ser Glu Ser Glu He Leu Pro Gly Glu Asn Gin 
565 570 575 



Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu Asn Ser Tyr 
580 585 590 



Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu Asp Phe Thr 
595 600 605 



Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala Lys Val Tyr 
610 615 620 



Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val Gin Gly 

62.5..,,,. . . . ; ,.630 . . .633 . . ' • 640 
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Gly Leu Phe Leu Met Txp Ala Asn Asp Val Val Glu Asp Phe Thr Thr 
645 650 655 



Asn lie Leu Arg Lys Asp Thr Leu Asp Lys lie Ser Asp Val Ser Ala 
660 665 670 



lie lie Pro Tyr lie Gly Pro Ala Leu Asn He Ser Asn Ser Val Arg 
675 680 685 



Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val Thr He Leu 
690 695 700 



Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly Ala Phe Val 
705 710 715 720 



He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys Thr He Asp 
725 730 735 



Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser Tyr Glu Trp 
740 745 750 



Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe Asn Asn He 
755 760 765 



Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly Ala He Lys 
770 775 780 



Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu 
785 790 795 800 



Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys 
805 810 815 



He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser 
820 825 830 



Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu 
835 840 845 



Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn Leu He Asp 
850 855 860 



Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu Lys Ala Lys 
865 870 875 880 
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Val Asn Asn Ser Phe Gin Asn Thr lie Pro Phe Asn He Phe Ser Tyr 
885 890 895 

Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Leu 
900 905 910 

Asp 

<210> 47 

<211> 2673 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 47 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


.gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 
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gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatacggt 


ggtttcatgg 


cgctagcggg 


cggtggcggt 


1380 


agcggcggtg 


gcggtagcgg 


cggtggcggt 


agcgcactag 


tgctgcagtg 


tatcaaggtt 


1440 


aacaactggg 


atttattctt 


cagcccgagt 


gaagacaact 


tcaccaacga 


cctgaacaaa 


1500 


ggtgaagaaa 


tcacctcaga 


tactaacatc 


gaagcagccg 


aagaaaacat 


ctcgctggac 


1560 


ctgatccagc 


agtactacct 


gacctttaat 


ttcgacaacg 


agccggaaaa 


catttctatc 


1620 


gaaaacctga 


gctctgatat 


catcggccag 


ctggaactga 


tgccgaacat 


cgaacgtttc 


1680 


ccaaacggta 


aaaagtacga 


gctggacaaa 


tataccatgt 


tccactacct 


gcgcgcgcag 


1740 


gaatttgaac 


acggcaaatc 


ccgtatcgca 


ctgactaact 


ccgttaacga 


agctctgctc 


1800 


aacccgtccc 


gtgtatacac 


cttcttctct 


agcgactacg 


tgaaaaaggt 


caacaaagcg 


1860 


actgaagctg 


caatgttctt 


gggttgggtt 


gaacagcttg 


tttatgattt 


taccgacgag 


1920 


acgtccgaag 


tatctactac 


cgacaaaatt 


gcggatatca 


ctatcatcat 


cccgtacatc 


1980 


ggtccggctc 


tgaacattgg 


caacatgctg 


tacaaagacg 


acttcgttgg 


cgcactgatc 


2040 


ttctccggtg 


cggtgatcct 


gctggagttc 


atcccggaaa 


tcgccatccc 


ggtactgggc 


2100 


acctttgctc 


tggtttctta 


cattgcaaac 


aaggttctga 


ctgtacaaac 


catcgacaac 


2160 


gcgctgagca 


aacgtaacga 


aaaatgggat 


gaagtttaca 


aatatatcgt 


gaccaactgg 


2220 


ctggctaagg 


ttaatactca 


gatcgacctc 


atccgcaaaa 


aaatgaaaga 


agcactggaa 


2280 


aaccaggcgg 


aagctaccaa 


ggcaatcatt 


aactaccagt 


acaaccagta 


caccgaggaa 


2340 


gaaaaaaaca 


acatcaactt 


caacatcgac 


gatctgtcct 


ctaaactgaa 


cgaatccatc 


2400 


aacaaagcta 


tgatcaacat 


caacaagttc 


ctgaaccagt 


gctctgtaag 


ctatctgatg 


2460 


aactccatga 


tcccgtacgg 


tgttaaacgt 


ctggaggact 


tcgatgcgtc 


tctgaaagac 


2520 


gccctgctga 


aatacattta 


cgacaaccgt 


ggcactctga 


tcggtcaggt 


tgatcgtctg 


2580 


aaggacaaag 


tgaacaatac 


cttatcgacc 


gacatccctt 


ttcagctcag 


taaatatgtc 


2640 


gataaccaac 


gccttttgtc 


cactctagac 


tag 






2673 



<210> 48 

<211> 890 

<212> PRT 

<213i> Artificial Sequence 



<220> 

<223> Synthetic 



wo 2006/059105 



69 



PCT/GB2005/004598 



Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145- • 150 155 160 



Asp He lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp. Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Se?: 
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245 



250 



255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 . 440 445 ' 



Tyr Gly Gly Phe Met Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly 
450 455 460 



Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val 
465 470 475 480 



Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn 
485 490 495 
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Asp Leu Asn Lys Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala 
500 505 510 



Ala Glu Glu Asn He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr 
515 520 525 



Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser 
530 535 540 



Ser Asp He He Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe 
545 550 555 560 



Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr 
565 570 575 



Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr 
580 585 590 



Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe 
595 600 605 



Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala 
610 615 620 



Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu 
625 630 635 640 



Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala Asp He Thr He He 
645 650 655 



He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys 
660 665 670 



Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu 
675 680 685 



Glu Phe He Pro Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu 
690 695 700 



Val Ser Tyr He Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn 
705 710 715 720 



Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He 
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Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg 
740 745 750 



Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala 
755 760 765 



He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn 
770 775 780 



He Asn Phe Asn He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He 
785 790 795 800 



Asn Lys Ala Met He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val 
805 810 815 



Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu 
820 825 830 



Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp 
835 840 845 



Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val 
850 855 860 



Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val 
865 870 875 880 



Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 



<210> 49 

<211> 2751 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 49 



ggatccatgg 


agttcgttaa 


caaacagttc aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt actatgactc cacctacctg 


240 


tctaccgdta 


acgaaaagga 


caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg acctgggccg 


tatgctgctg actagcatcg ttcgcggtat cccgttctgg 


360 
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v> a i» V* y a u a w 


orra a r^'h' rra a a 
^yaaw uy acta 


rtt" a a ^ /^rra ^a 

y uaauwyaCa 


li* a a w ii« y v.* <^ ^ 


wciM.v»y im wa L» w 


420 




y u L.u>^ cau^y 


^ ^ f^ci^ a fra a 
L. wwwyaay cLa 


w L.y aaww i«yy 


t"na t* r»at" c*cscs 
L.yauL.a uwyy 




480 


y Cl Iv Cl L> ^ GiU Sir v« 


ay uuuyayuy 


Km ctay ay w u u l. 


y y u a Cy aay 


u L>v>» u^y aaw w c 


("•a rTTTT^ a a o 
v#awv«v<py L>aav>» 


^ V v 


y y C li- cit-gy u u 


rr* a r» ^ r^a a 
v»^ai>* L» way L>a 


r'a ^ c^r*fi^ ^ ^ o 
u>aL«u>u>y i> !,» L*w 


uw uwL>yy a^ u 


uwawwui-wyy 


^ t" 1* h aa a cf a a 
u w L> L«y ggy gg 


w w v 


uccc uy y ady 


4- "ir^a^a /*»rta a 

uayaCauyaa 


CCCaCcy C uy 


yycyCuyy L.a 


a a r^f^f^ a a 

aau ucy Caao 


uyciUwv.* uy wy 


ODU 


4> 4> ^ ^ ^ 4* 

y tudCCCCyy 


cccacgaacc 


gau ucaugca 


^Y^V^^^^ A ^^^^^^^^^ 

ggccaccgcc 


ugcacggcac 


Cy ccatcaa^ 


/ zu 


CCyaaCCy uy 


bCCucaaagc 


A A A #*• A A 

u aac ac c aac 


AV^^^^f* A 4* ^ A ^^^^ 

gcgcaucacg 


agacgcccgg 


uccggaay uc 


/ OU 




aact^gcgtac 


t^tttggcggt 


cacgacgctza 


aabtcatcga 


cuccccgcaa 




yaaa.aCya.yC 


uccgccuy ca 


ccacuacaac 


A A ^*r^* ^ AAA 

aagc ucaaag 


acaucgcacc 


cacccugaac 


OA A 




ccatcgtggg 


t^accactgct 


tctctccagt 


acatgaagaa 


cgcuuccaaa 


o ^ n 


9daaaatiacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctg'tagacaa 


gtm %m 4v A A A 4* ^ 

guugaaacuc 


1 AOA 


yacaaacbuu 


acaaaaugcc 


A 4* A A A 4* 4* 

gacugaaamc 


cacaccgaag 


acaacuucgu 


t- a ar<f^ 4- ^ ^ 

caagccccuu 


1 noA 


aaagttct^ga 


accgcaaaac 


ctatctgaac 


uccgacaagg 


^ A A ^ 4* A A 

caguauccaa 


A A 4* A A A 4^ 

aaccaacacc 




gt^cc^aaag 


titaactacac 


tatctacgati 


ggtttcaacc 


tigcgt.aacac 


caacctzggct. 


T O A A 


yCuaaHuCua 


acggccagaa 


cacggaaatic 


A A A A A 4" A 

aacaacacga 


A ^ ^ A A A A 

accucacaaa 


A #*t4* ^^A A A A A 

accgaaaaac 




4<* 4"* ^ jwt'W jf^t 

uuCaCuyy uC 


uguucgaguu 


ccacaagcug 


cugugcgccg 


A ^^^^0*m A 4* A 4* 

acggcaccac 


uaccuccaaa 


1 o O A 


accaaauCuc 


tgatcgaagg 


tcgttacggt 


ggtttcatga 


cctctgaaaa 


atct^cagacc 


1 O O A 




ccccgcucaa 


aaacgctatc 


atcaaaaacg 


cttiacaaaaa 


aggtgaagcg 


1 Ayi A 


ctagcgggtg 


guggtggucc 


4* ^^0*9^^ ^v^^^^ 

tggtggtggt 


ggccctggcg 


gtggtggttc 


cgcaccagcg 


1 CAA 


ctgcagtgba 


^ A A ^V^V^ ^ A A 

u c aagg u u aa 


c aac cggg ac 


ccacucuuca 


gcccgagtga 


agacaacccc 


1 C£ A 


accaacyacc 


CyaaCaaayy 


uyaayaaacc 


accucagaca 


^t4* A A A ^ tf>^ A 

ccaacaccga 


agcagc eg aa 


T O A 


!3 ^ O /*» ^ ^ /** t" 

yaaaaCauC L. 


cy c uyyaccu 


yauCCay Cay 


X^aCUaCCuya 


cccucaau u u 


^^^^ A A A ^t^V A 

cgacaacgag 


1 O A 


f^fiftfta a a a *^a 
wii^yy aaaav«a 


L> c L«L> cauwya 


aaa/^/^l"rtai*T^ 
aaai^ cy ay c 


^/^^f^at'at'r^a 
cv* uy auauwa 


ucyyccay c t. 


yyaaCuycil^y 


1 TAA 


^^y aa^a L>L»y 




a a a f^ctn^ a a a 
aaaiM>y y L»aaa 


a a a i^na rtt^ 
aay uawyay w 


t" fma a a a 4* a 
uyyawaaa ta 


u a w (.* a k> y I* t Vo 




r'ar't" ^r*r*^ rxr* 
\^cL\^ uai.#\,* uy v«> 


y v-yuy wayya 


at~t~+"<*raar'ar^ 
au L. uy aawa^ 


yy waaauwww 


y uaucy Ccic u 


/^a/^t»aa/^4*/^/^ 

yciui.aac ucc 


xo o u 


Cf i* a a r^nA a 
y u ^aa\^y aay 


(.« ww L»y Vol* waa 


wwwy uwwL>y L» 


rr^ a t" a r« a o r« i" 
y ua L>awciw w 


'h<*»t"^r»+'r»t'a#^ 
I. W L. LtW L> w L>ay 


Cyac uacy ii.y 


X U 


aaaaaggtca 


acaaagcgac 


tgaagctgca 


atgttcttgg 


gttgggttga 


acagcttgtt 


1980 


tatgatttta 


ccgacgagac 


gtccgaagta 


tctactaccg 


acaaaattgc 


ggatatcact 


2040 


atcatcatcc 


cgtacatcgg 


tccggctctg 


aacattggca 


acatgctgta 


caaagacgac 


2100 


ttcgttggcg 


cactgatctt 


ctccggtgcg 


gtgatcctgc 


tggagttcat 


cccggaaatc 


216C 
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gccatcccgg 


tactgggcac 


ctttgctctg gtttcttaca 


ttgcaaacaa 


ggttctgact 


2220 


gtacaaacca 


tcgacaacgc 


gctgagcaaa cgtaacgaaa 


aatgggatga 


agtttacaaa 


2280 


tatatcgtga 


ccaactggct 


ggctaaggtt aatactcaga 


tcgacctcat 


ccgcaaaaaa 


2340 


atgaaagaag 


cactggaaaa 


ccaggcggaa gctaccaagg 


caatcattaa 


ctaccagtac 


2400 


aaccagtaca 


ccgaggaaga 


aaaaaacaac atcaacttca 


acatcgacga 


tctgtcctct 


2460 


aaactgaacg 


aatccatcaa 


caaagctatg atcaacatca 


acaagttcct 


gaaccagtgc 


2520 


tctgtaagct 


atctgatgaa 


ctccatgatc ccgtacggtg 


ttaaacgtct 


ggaggacttc 


2580 


gatgcgtctc 


tgaaagacgc 


cctgctgaaa tacatttacg 


acaaccgtgg 


cactctgatc 


2640 


ggtcaggttg 


atcgtctgaa 


ggacaaagtg .aacaatacct 


tatcgaccga 


catccctttt 


2700 


cagctcagta 


aatatgtcga 


taaccaacgc cttttgtcca 


ctctagacta 


g 


2751 



<210> 50 
<211> 916 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 50 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Alai Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 ' 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie -yal' Arg' 'Gi^r>Xle- Pro -Phe. .Trp -Gly Gly -Ser- Thr - II© Asp-Thr Glu 
115 120 125 
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Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
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Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Met Thr Ser Glu Lys Ser Gin Thr Pro Leu Val Thr 
450 455 460 



Leu Phe Lys Asn Ala He He Lys Asn Ala Tyr Lys Lys Gly Glu Ala 
465 470 475 480 



Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
485 490 495 , 



Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe 
500 505 510 



Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu 
515 520 525 



Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser 
530 535 540 



Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu 
545 550 555 560 



Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin 
565 570 575 



Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr 
580 585 590 



Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe 
595 600 605 
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Glu His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala 
610 615 620 



Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val 
625 630 635 640 



Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val 
. > 645 650 655 



Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr 
660 665 670 



Thr Asp Lys lie Ala Asp lie Thr lie He He Pro Tyr He Gly Pro 
675 680 685 



Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala 
690 695 700 



Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He 
705 710 715 720 



Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn 
725 730 735 



Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn 
740 745 750 



Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala 
755 760 765 



Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala 
770 775 780 



Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr 
785 790 795 800 



Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp 
805 810 815 



Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn 
820 825 830 



He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser 
835 840 845 



Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu 
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850 855 660 

Lys Asp Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu lie 
865 870 875 880 

Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr 
885 890 895 

Asp lie Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu 
900 905 910 

Ser Thr Leu Asp 
915 

<210> 51 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 51 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


9gcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 
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gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccaaaa 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcaacaaaa 


cagtattcaa 


aatcaacatc 


1140 


qtQCcgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatsc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tcrttcaaatt 


ttacaagc^g 


ctatacatca 
w j3 ^y ^y ^ wy 


acaacatcat 

MW^^ WM WW«A w 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tiagattliggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct 


aacacracaati 
o-y v-'yyy ^yy ^ 


yy v-yy uay i-.y 


y v.-yy ^yy ^yy 


*"^y **y y ^y y 


1440 


crcrcacrti aac a 


cactaatact 

ww&^q v^n^^ V 


acaatatatc 
y ^y 


cmyy ^ ^cici^ci 


ac t aaaa 1 1 1 


attcttcaac 

C4 1« w w w w ciy w 


1500 


ccgagtgaag 


acaactitcac 


caacoacctcf 


aapaaaaoi"a 

viCLwctcmy y wy 


aaaaaatcac 


et eaaat act 


1560 


aacaticcraacr 


c aac caaaoa 


aaacatctca 


ctaaapchaa 


tccaacaata 


ctacct'aacc 


1620 


ttitaattticQ 

\m W \m \m W ^ 


acaaccaacc 


aaaaaacatit 


te tat caaaa 


aeetaaacte 


taa tat catc 

i»y ci wCi bwCi ww 


1680 


aac c a.ac! tioa 


aackaatacc 


aaacatcaaa 


catttccoaa 


db^^ bGLaCLGLvL 


y uaw-yay I.* uy 


1740 


aa c a-aat!! d-ii a. 


ccatafctcca 


c t a e e t ar* err* 


cicaT'aacra a t 
y y a y y a a L. 


t i"aa a pa PCTcr 
\^ wy gctv^ci wy y 


V^CLCLCll,»VM>\«>^y L« 


1 ft no 

XO V V/ 


atcgcactga 


ctaactccati 


tiaacaaaact 


ctactcaacc 

W *»y w ^ V«UMW Vi« 


catcGcatat" 
v»y b w\rf ^y **y 


ataeaccttc 


1860 


1 1 ctzc t agcg 


actacataaa 


aaaacrtcaac 


a a aacaa eta 


aaapt'opaa^ 


y u L> w Im> *-y yy *- 


1920 


taaattoaac 


aacttatfcta 


toafctttarr* 


Clr\^C*Clr^aei Pat 


w^yocLy L«cikp^ 






aaaattaccra 


atatcactati 


cateatceco' 


taeatcaatc 


^QQC^ c toaa 

v*yy w \^ \^ k>y cia 


wm b wy y wciaw 


2040 


atoctataca 


aaaacaactt 


cat t craccrc a 
^y ^ ^y y *— y 


ctaatcttct 


^^^yy ^y *^y y ^ 


aa 'h c p i" act* cr 
y CL i» v«w wy w u«y 


2100 


aaoti ii cati c c 

^3 23 ^ V* \i* Iw4» V» %^ v* 


caaaaatcac 


c at cccaat a 


ctaaapacct 


ttactctaot 
w wy w ^y y ^ 


ttcttacatt- 


2160 


a a.a a. G a a.cra 


ttcfcaaetat 
w by ^y ^ 




oa r* a a p Of* or* 
y cn^ciciby wy w 


u y ay V* cidci wy 


wciciv»y cicicicici 


2220 


taaaataaaa 


tttacaaata 


t atiecrtcf a c c 


aaptaaftoa 
^yy te^«4 


c taaaat" t aa 


^apt paaat p 


2280 


gacctcatcc 


gcaaaaaaat 


aaaaaaaaca 


ctaaaaaacc 


aaacaaaaac 

*»y y ^y y V* 


taccaaaaca 


2340 


atcattaact 


accagtacaa 


ccagtacacc 


gaggaagaaa 


aaaacaacat " 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca 


aagctatgat 


caacatcaac 


2460 


aagttcctga 


accaatactc 


tafcaaactat 


ctaataaact 


ccatcat ccc 


at acaat a 1 1 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg 


aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat 


cgtctgaagg 


acaaagtgaa 


caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa 


tatgtcgata 


accaacgcct 


tttgtccact 


2700 


ctagactag 












2709 
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<211> 902 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 52 

Gly. Ser. Met Glu Phe Val Asn..Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser. Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 
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Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly lie Ala lie Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala- Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
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450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr .Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 
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Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 53 

<211> 2691 

<212> DNA 

<213> Artificial Sequence 



"<223> Synthetic 
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<400> 53 



ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa 


cggtgttgac 


60 


attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga 


agaaggtgac 


180 


ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc 


cacctacctg 


240 


tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt 


cgagcgtatt 


300 


tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat 


cccgttctgg 


360 


ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat 


caacgttatt 


420 


cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg 


cccgtctgct 


480 


gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct 


cacccgtaac 


540 


ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg 


ttttgaagaa 


600 


tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac 


tgatcctgcg 


660 


gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg 


tctggaagtt 


780 


agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga 


ctctctgcaa 


840 


gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc 


caccctgaac 


900 


aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa 


gttgaaattc 


1020 


gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt 


taagttcttt 


1080 


aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac 


caacctggct 


1200 


gctaatttta acggccagaa cacgg^aatc aacaacatga acttcacaaa 


actgaaaaac 


1260 


ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat 


tacctccaaa 


1320 


actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa 


atcagcggcg 


1380 


ctagcgggcg gtggcggtag cggcggtggc ggtagcggcg gtggcggtag 


cgcactagtg 


1440 


ctgcagtgta tcaaggttaa caactgggat ttattcttca gcccgagtga 


agacaacttc 


1500 


accaacgacc tgaacaaagg tgaagaaatc acctcagata ct'aacatcga 


agcagccgaa 


1560 


gaaaacatct cgctggacct gatccagcag tactacctga cctttaattt 


cgacaacgag 


1620 


ccggaaaaca tttctatcga aaacctgagc tctgatatca tcggccagct 


ggaactgatg 


1680 


ccgaacatcg aacgtttccc aaacggtaaa aagtacgagc tggacaaata 


taccatgttc 


1740 


' cactacctgc gcgcgcagga atttgaacac ggcaaatccc gtatcgcact 


gactaactcc 


1800 
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ct pt crr'i"f*aa 


f n r* crt" r* c ccf t 

V«p O V» V«» V* J Im> 


cr1"atar«ar»r«t ♦■r»t"tr»t f tacf 


fCf a c t ar»Cf t cr 


X o o u 


a. a. a. a aaa ti c! a. 




t"Cfaacrptaca 

Cloy v»» 


^ V- L- uyy yuuyyyi-i*yw 


acacrnt tcit t 


X ^ u 


t all aali t ti ti a 


^ W ^ W VL 


at ccaaaota 


tctaotarTTT afaaaattoc 
isdiv^y d^dddd i»» wy ^ 


aaatatcact 


^ J \j \j 


a teat" cat* t^c 


naf" a r»a t* naa 


t f» r^oar* t f* t cr 

U v» ^ w i« 


dd^di* wyy w>d dL>duy^i«y wd 


w« dddy dw>y dw 




1 1 ocrt" i" anna 


f»a n cf a t r* t ^ 




y uy d c v«L. uy uyycty L.i«^dL. 


^ y y ddd u 1^ 


91 Oft 


accatccf^crci 

>n W w W ^ W w V4 ^ 






yuucouudwd uuy VodddKodd 


rtnt t ^ t rT54 ^t 


^ xo v/ 


^ci^ciciau«L>ci 




ycugagcddd 


Cy uddCyddd ddugyydCgd 


a < n t ^ 4* a #^ a a a 

agt^cacaaa 




54 f* J* 4" r«nt" rra 


C C dd\J L.^^C 


y y c u ddgg u u 


ddUdCuCdga ucgdCCuCdu 


ccgcaaddad 


oo ft n 


Cl ClClCl^CLCt^ 


L>dL> L.^^dddd 


CCdy^Cyyao. 


a a a r^a94*^a4*4'aa 

yccdCCdagg caaucauuaa 


ccaccaguac 




d ct V* L. l« etc a 


CC^d^^ddgd 


daddadCdaC 


aucaacucca acaucgacga 


uc uguccucu 


OA Art 




dduwCdUCdd 


CdddyCUdug 


auCdacauca acaagcuccu 


gaaccagtgc 




tctgtaagct 


atctgatgaa 


ctccatgatc 


ccgtacggtg ttaaacgtct 


ggaggacttc 


2520 


gatgcgtctc 


tgaaagacgc 


cctgctgaaa 


tacatttacg acaaccgtgg 


cactctgatc 


2580 


ggtcaggttg 


atcgtctgaa 


ggacaaagtg 


aacaatacct tatcgaccga 


catccctttt 


2640 


cagctcagta 


aatatgtcga 


taaccaacgc 


cttttgtcca ctctagacta 


g 


2691 



<210> 54 
<211> 896 
<212> PRT 

<213>. Artificial Sequence 
<220> 

<223> Synthetic 
<400> 54 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 

Asn Gly Val Asp lie Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 . 70 75 , 80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 

. . 355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Ala Leu Ala Gly Gly 
450 455 460 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val 
465 470 475 480 



Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser 
485 490 495 



Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser 
500 505 510 



Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He 
515 520 525 



Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He 
530 535 540 



Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met 
545 550 555 560 



Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys 
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565 



570 



575 



Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys 
580 585 590 



Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
595 600 605 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn 
610 615 620 



Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val 
625 630 635 640 



Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He 
645 650 655 



Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He 
660 665 670 



Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser 
675 680 685 



Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val 
690 695 700 



Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr 
705 710 715 720 



Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp 
725 730 735 



Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr 
740 745 750 



Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin 
755 760 765 



Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr 
770 775 780 



Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser 
785 790 795 800 



Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe 
805 810 815 
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89 



Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn 
820 825 



Ser Met He Pro Tyr 
830 



Gly Val Lys Arg Leu Glu T^p Phe Asp Ala Ser 
835 840 



Leu Lys Asp Ala Leu 
845 



Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu 
850 855 



He Gly Gin Val Asp 
860 



Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser 
865 870 875 



Thr Asp He Pro Phe 
880 



Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu 
885 890 



Leu Ser Thr Leu Asp 
895 



<210> 55 

<211> 2691 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 55 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


srgcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


igaaaacgagt 


tGcgtctgta 


ctactataac 


•aagttcaaag 


atatcgcatc 


caccctgaac 


900 
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aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa atatgcggcg 1380 

ctagcgggcg gtggcggtag cggcggtggc ggtagcggcg gtggcggtag cgcactagtg 1440 

ctgcagtgta tcaaggttaa caactgggat ttattcttca gcccgagtga agacaacttc 1500 

accaacgacc tgaacaaagg tgaagaaatc acctcagata ctaacatcga agcagccgaa 1560 

gaaaacatct cgctggacct gatccagcag tactacctga cctttaattt cgacaacgag 1620 

ccggaaaaca tttctatcga aaacctgagc tctgatatca tcggccagct ggaactgatg 1680 

ccgaacatcg aacgtttccc aaacggtaaa aagtacgagc tggacaaata taccatgttc 1740 

cactacctgc gcgcgcagga atttgaacac ggcaaatccc gtatcgcact gactaactcc 1800 

gttaacgaag ctctgctcaa cccgtcccgt gtatacacct tcttctc.tag cgactacgtg 1860 

aaaaaggtca acaaagcgac tgaagctgca atgttcttgg gttgggttga acagcttgtt 1920 

. tatgatttta ccgacgagac gtccgaagta tctactaccg acaaaattgc ggatatcact 1980 

at cat cat cc cgtacatcgg tccggctctg aacattggca acatgctgta caaagacgac 2040 

ttcgttggcg cactgatctt ctccggtgcg gtgatcctgc tggagttcat cccggaaatc 2100 

gc.catcccgg tactgggcac ctttgctctg gtttcttaca ttgcaaacaa ggttctgact 2160 

gtacaaacca tcgacaacgc gctgagcaaa cgtaacgaaa aatgggatga agtttacaaa 2220 

tatatcgtga ccaactggct ggctaaggtt aatactcaga tcgacctcat ccgcaaaaaa 2280 

atgaaagaag cactggaaaa ccaggcggaa gctaccaagg caatcattaa ctaccagtac 2340 

aaccagtaca ccgaggaaga aaaaaacaac atcaacttca acatcgacga tctgtcctct 2400 

aaactgaacg aatccatcaa caaagctatg atcaacatca acaagttcct gaaccagtgc 2460 

tctgtaagct atctgatgaa ctccatgatc ccgtacggtg ttaaacgtct ggaggacttc 2520 

gatgcgtctc tgaaagacgc cctgctgaaa tacatttacg acaaccgtgg cactctgatc 2580 

ggtcaggttg atcgtctgaa ggacaaagtg aacaatacct tatcgaccga catccctttt 2640 

cagctcagta aatatgtcga taaccaacgc cttttgtcca ctctagacta g 2691 
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<210> 56 
<211> 896 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 56 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 • 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 . 



wo 2006/059105 



92 



PCT/GB2005/004598 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 
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Phe Gly Gly Phe Thr Gly Ala Arg Lys Tyr Ala Ala Leu Ala Gly Gly 
450 455 460 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val 
465 470 475 480 



Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser 
485 490 495 



Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser 
500 505 510 



Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He 
515 520 525 



Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He 
530 535 540 



Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met 
545 550 555 560 



Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys 
565 570 575 



Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys 
580 585 590 



Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
595 600 605 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn 
610 615 620 



Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val 
625 630 635 640 



Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He 
645 650 655 



Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He 
660 665 670 



Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser 
675 680 685 



Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val 
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690 695 700 



Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr 
705 710 715 720 



Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp 
725 730 735 



Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr 
740 745 750 



Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin 
755 760 765 



Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr 
770 775 780 



Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser 
785 790 795 800 



Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe 
805 810 815 



Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr 
820 825 830 



Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
835 840 845 



Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp 
850 855 860 



Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe 
865 870 875 880 



Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 895 



<210> 57 

<211> 2691 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 57 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
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attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 

cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 

ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa atcatatgcg 1380 

ctagcgggcg gtggcggtag cggcggtggc ggtagcggcg gtggcggtag cgcactagtg 1440 

ctgcagtgta tcaaggttaa caactgggat ttattcttca gcccgagtga agacaacttc 1500 

accaacgacc tgaacaaagg tgaagaaatc acctcagata ctaacatcga agcagccgaa 1560 

gaaaacatct cgctggacct gatccagcag tactacctga cctttaattt cgacaacgag 1620 

ccggaaaaca tttctatcga aaacctgagc tctgatatca tcggccagct ggaactgatg 1680 

ccgaacatcg aacgtttccc aaacggtaaa aagtacgagc tggacaaata taccatgttc 1740 

cactacctgc gcgcgcagga atttgaacac ggcaaatccc gtatcgcact gactaactcc 1800 

gttaacgaag ctctgctcaa cccgtcccgt gtatacacct tcttctctag cgactacgtg 1860 
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aaaaaggtca 


acaaagcgac 


tgaagctgca atgttcttgg gttgggttga acagcttgtt 


1920 


tatgatttta 


ccgacgagac 


gtccgaagta tctactaccg acaaaattgc ggatatcact 


1980 


atcatcatcc 


cgtacatcgg 


tccggctctg aacattggca acatgctgta caaagacgac 


2040 


ttcgttggcg 


cactgatctt 


ctccggtgcg gtgatcctgc tggagttcat cccggaaatc 


2100 


gccatcccgg 


tactgggcac 


ctttgctctg gtttcttaca ttgcaaacaa ggttctgact 


2160 


gtacaaacca 


tcgacaacgc 


gctgagcaaa cgtaacgaaa aatgggatga agtttacaaa 


2220 


tatatcgtga 


ccaactggct 


ggctaaggtt aatactcaga tcgacctcat ccgcaaaaaa 


2280 


atgaaagaag 


cactggaaaa 


ccaggcggaa gctaccaagg caatcattaa ctaccagtac 


2340 


aaccagtaca 


ccgaggaaga 


aaaaaacaac atcaacttca acatcgacga tctgtcctct 


2400 


aaactgaacg 


aatccatcaa 


caaagctatg atcaacatca acaagttcct gaaccagtgc 


2460 


tctgtaagct 


atctgatgaa 


ctccatgatc ccgtacggtg ttaaacgtct ggaggacttc 


2520 


gatgcgtctc 


tgaaagacgc 


cctgctgaaa tacatttacg acaaccgtgg cactctgatc 


2580 


ggtcaggttg 


atcgtctgaa 


ggacaaagtg aacaatacct tatcgaccga catccctttt 


2640 


cagctcagta 


aatatgtcga 


taaccaacgc cttttgtcca ctctagacta g 


2691 



<210> 58 
<211> 896 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 58 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp lie Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Tlir Tyr Leu 
65 70 75 80 



Ser-- Thr Asp Asn Glu , Lys Asp^Asn Ty.r-,L(eu,.Jbys; Gly_Val- .Thr .&ys Leu 
85 90 ' " 95 
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Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 

- - -325. . , z.r.C 'r-/- 33^ ^.sl , 335 
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Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Tyr Ala Leu Ala Gly Gly 
450 455 460 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val 
465 470 475 480 



Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser 
485 490 495 



Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser 
500 505 510 



Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He 
515 520 525 



Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He 
530 535 540 



Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met 
545 550 555 560 



Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys 
565 570 575 
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Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys 
580 585 590 



Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
595 600 605 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn 

. .610 615 620 



Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val 
625 630 635 640 



Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys lie 
645 650 655 



Ala Asp lie Thr lie lie lie Pro Tyr lie Gly Pro Ala Leu Asn lie 
660 665 670 



Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie Phe Ser 
675 680 685 



Gly Ala Val lie Leu Leu Glu Phe lie Pro Glu lie Ala He Pro Val 
690 695 700 



Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr 
705 710 715 720 



Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp 
725 730 735 



Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr 
740 745 750 



Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin 
755 760 765 



Ala. Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr 
770 775 780 



Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser 
785 790 795 800 



Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe 
805 810 815 



Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr 
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820 825 830 

Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
835 840 845 

Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp 
850 855 860 

Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe 
865 870 875 880 

Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 895 

<210> 59 
<211> 2709 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 59 



ggatccatgg 


agttcgttaa 


caaacagttc aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg actagcatcg ttcgcggtat cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa ctgajacctgg tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt ggtcacgaag ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg ggcgctggta aattcgcaac tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca ggccaccgcc tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac gcgtattacg agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt cacgacgcta aattcatcga ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac aagttcaaag atatcgcatc caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct tctctccagt acatgaagaa cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc ggcaaattct ctgtagacaa gttgaaattc 


1020 
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gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaacutca 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaatatgcta 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc 


gacaacgcgc 


tgagcaaacg 


taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca 


ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc 


gaggaagaaa 


aaaacaacat 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca 


aagctatgat 


caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat 


ctgatgaact 


ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg 


aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat 


cgtctgaagg 


acaaagtgaa 


caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa 


tatgtcgata 


accaacgcct 


tttgtccact 


2700 


ctagactag 












2709 



<210> 60 
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<211> 902 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 60 

Gly Ser Met Glu. Phe Val.Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp lie Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 
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Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly lie Ala lie Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys . Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Tyr Ala Asn 
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450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 
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Glu lie Ala lie Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr lie 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr lie Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 61 

<211> 2697 

<212> DNA 

<213> Artificial Sequence 

<220.> . . . 

<223> Synthetic"* 
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<400> 61 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 

attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 

cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 

ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa atcagcgcgt 1380 

aaagcgctag cgggcggtgg cggtagcggc ggtggcggta gcggcggtgg cggtagcgca 1440 

ctagtgctgc agtgtatcaa ggttaacaac tgggatttat tcttcagccc gagtgaagac 1500 

aacttcacca acgacctgaa caaaggtgaa gaaatcacct cagatactaa catcgaagca 1560 

gccgaagaaa acatctcgct ggacctgatc cagcagtact acctgacctt taatttcgac 1620 

aacgagccgg aaaacatttc tatcgaaaac ctgagctctg atatcatcgg ccagctggaa 1680 

ctgatgccga acatcgaacg tttcccaaac ggtaaaaagt acgagctgga caaatatacc 1740 

atgttccact acctgcgcgc gcaggaattt gaacacggca aatcccgtat cgcactgact 1800 
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aactccgtta 


acgaagctct 


gctcaacccg 


tcccgtgtat 


acaccttctt 


ctctagcgac 


1860 


tacgtgaaaa 


aggtcaacaa 


agcgactgaa 


gctgcaatgt 


tcttgggttg 


ggttgaacag 


1920 


cttgtttatg 


attttaccga 


cgagacgtcc 


gaagtatcta 


ctaccgacaa 


aattgcggat 


1980 


atcactatca 


tcatcccgta 


catcggtccg 


gctctgaaca 


ttggcaacat 


gctgtacaaa 


2040 


gacgacttcg 


ttggcgcact 


gatcttctcc 


ggtgcggtga 


tcctgctgga 


gttcatcccg 


2100 


gaaatcgcca 


tcccggtact 


gggcaccttt 


gctctggttt 


cttacattgc 


aaacaaggtt 


2160 


ctgactgtac 


aaaccatcga 


caacgcgctg 


agcaaacgta 


acgaaaaatg 


ggatgaagtt 


2220 


tacaaatata 


tcgtgaccaa 


ctggctggct 


aaggttaata 


ctcagatcga 


cctcatccgc 


2280 


aaaaaaatga 


aagaagcact 


ggaaaaccag 


gcggaagcta 


ccaaggcaat 


cattaactac 


2340 


cagtacaacc 


agtacaccga 


ggaagaaaaa 


aacaacatca 


acttcaacat 


cgacgatctg 


2400 


tcctctaaac 


tgaacgaatc 


catcaacaaa 


gctatgatca 


acatcaacaa 


gttcctgaac 


2460 


cagtgctctg 


taagctatct 


gatgaactcc 


atgatcccgt 


acggtgttaa 


acgtctggag 


2520 


gacttcgatg 


cgtctctgaa 


agacgccctg 


ctgaaataca 


tttacgacaa 


ccgtggcact 


2580 


ctgatcggtc 


aggttgatcg 


tctgaaggac 


aaagtgaaca 


ataccttatc 


gaccgacatc 


2640 


ccttttcagc 


tcagtaaata 


tgtcgataac 


caacgccttt 


tgtccactct 


agactag 


2697 



<210> .62 

<211> 898 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Synthetic 

<400> 62 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1,5. 10 15 

Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 

Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 .70 75 80 
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Sex Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
ICQ 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355... 360 - 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Ala Leu Ala 
450 455 460 



Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala 
465 470 475 480 



Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser 
485 490 495 



Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He 
500 505 510 



Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp 
515 520 525 



Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu 
• 530 535 540 



Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu 
545 550 555 560 



Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu 



wo 2006/059105 



110 



PCT/GB2005/004598 



565 570 575 



Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His 
580 585 590 



Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu 
595 600 605 



Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys 
610 615 620 



Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin 
625 630 635 640 



Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp 
645 650 655 



Lys lie Ala Asp lie Thr lie lie lie Pro Tyr lie Gly Pro Ala Leu 
660 665 670 



Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He 
675 680 685 



Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He 
690 695 700 



Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val 
705 710 • 715 720 



Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys 
725 730 735 



Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val 
740 745 750 



Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu 
755 760 765 



Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin 
770 775 • 780 



Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu 
785 790 795 800 



Ser Ser Lys Leu Asn Glu. Ser He Asn Lys Ala Met He Asn He Asn 
805 810 815 
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111 



Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu 
820 825 



Met Asn Ser Met lie 
830 



Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp 
835 840 



Ala Ser Leu Lys Asp 
845 



Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly 
850 855 



Thr Leu lie Gly Gin 
860 



Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr 
865 870 875 



Leu Ser Thr Asp lie 
880 



Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin 
885 890 



Arg Leu Leu Ser Thr 
895 



Leu Asp 



<210> 63 

<211> 2736 

<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 63 



ctcgggattg agggtcgttt 


tggcggtttc 


acgggcgcac 


gcaaatcagc 


gcgtaaatta 


60 


gctaaccaga ctagtggcgg 


tgggggtagt 


ggcggtggcg 


gttcgggcgg 


gggtgggagc 


120 


cctaggggat ccatggagtt 


cgttaacaaa 


cagttcaact 


ataaagaccc 


agttaacggt 


180 


gttgacattg cttacatcaa 


aatcccgaac 


gctggccaga 


tgcagccggt 


aaaggcattc 


240 


aaaatccaca acaaaatctg 


ggttatcccg 


gaacgtgata 


cctttactaa 


cccggaagaa 


300 


ggtgacctga acccgccacc 


ggaagcgaaa 


caggtgccgg 


tatcttacta 


tgactccacc 


360 


tacctgtcta ccgataacga 


aaaggacaac 


tacctgaaag 


gtgttactaa 


actgttcgag 


420 


cgtatttact ccaccgacct 


gggccgtatg 


ctgctgacta 


gcatcgttcg 


cggtatcccg 


480 


ttctggggcg gttctaccat 


cgataccgaa 


ctgaaagtaa 


tcgacactaa 


ctgcatcaac 


540 


gttattcagc cggacggttc 


ctatcgttcc 


gaagaactga 


acctggtgat 


catcggcccg 


600 


tctgctgata tcatccagtt 


cgagtgtaag 


agctttggtc 


acgaagttct 


gaacctcacc 


660 


cgtaacggct acggttccac 


tcagtacatc 


cgtttctctc 


cggacttcac 


cttcggtttt 


720 


: gaagaatccc -tggaagt aga 


-cacgaacccar ctgctgggcg- 


ctggtaaatt 


cgcaactgat 


780 
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cctgcggtta ccctggctca cgaactgatt catgcaggcc accgcctgta cggtatcgcc 840 

atcaatccga accgtgtctt caaagttaac accaacgcgt attacgagat gtccggtctg 900 

gaagttagct tcgaagaact gcgtactttt ggcggtcacg acgctaaatt catcgactct 960 

ctgcaagaaa acgagttccg tctgtactac tataacaagt tcaaagatat cgcatccacc 1020 

ctgaacaaag cgaaatccat cgtgggtacc actgcttctc tccagtacat gaagaacgtt 1080 

tttaaagaaa aatacctgct cagcgaagac acctccggca aattctctgt agacaagttg 1140 

aaattcgata aactttacaa aatgctgact gaaatttaca ccgaagacaa cttcgttaag 1200 

ttctttaaag ttctgaaccg caaaacctat ctgaacttcg acaaggcagt attcaaaatc 1260 

aacatcgtgc cgaaagttaa ctacactatc tacgatggtt tcaacctgcg taacaccaac 1320 

ctggctgcta attttaacgg ccagaacacg gaaatcaaca acatgaactt cacaaaactg 1380 

aaaaacttca ctggtctgtt cgagttttac aagctgctgt gcgtcgacgg catcattacc 1440 

tccaaaacta aatctctgat agaaggtaga aacaaagcgc tgaacctgca gtgtatcaag 1500 

gttaacaact gggatttatt cttcagcccg agtgaagaca acttcaccaa cgacctgaac 1560 

aaaggtgaag aaatcacctc agatactaac atcgaagcag ccgaagaaaa catctcgctg 1620 

gacctgatcc agcagtacta cctgaccttt aatttcgaca acgagccgga aaacatttct 1680 

atcgaaaacc tgagctctga tatcatcggc cagctggaac tgatgccgaa catcgaacgt 1740 

ttcccaaacg gtaaaaagta cgagctggac aaatatacca tgttccacta cctgcgcgcg 1800 

caggaatttg aacacggcaa atcccgtatc gcactgacta actccgttaa cgaagctctg 1860 

ctcaacccgt cccgtgtata caccttcttc tctagcgact acgtgaaaaa ggtcaacaaa 1920 

gcgactgaag ctgcaatgtt cttgggttgg gttgaacagc ttgtttatga ttttaccgac 1980 

gagacgtccg aagtatctac taccgacaaa attgcggata tcactatcat catcccgtac 2040 

atcggtccgg ctetgaacat tggcaacatg ctgtacaaag acgacttcgt tggcgcactg 2100 

atcttctccg gtgcggtgat cctgctggag ttcatcccgg aaatcgccat cccggtactg 2160 

ggcacctttg ctctggtttc ttacattgca aacaaggttc tgactgtaca aaccatcgac 2220 

aacgcgctga gcaaacgtaa cgaaaaatgg gatgaagttt acaaatatat cgtgaccaac 2280 

tggctggcta aggttaatac tcagatcgac ctcatccgca aaaaaatgaa agaagcactg 2340 

gaaaaccagg cggaagctac caaggcaatc attaactacc agtacaacca gtacaccgag 2400 

gaagaaaaaa acaacatcaa cttcaacatc gacgatctgt cctctaaact gaacgaatcc 2460 

atcaacaaag ctatgatcaa catcaacaag ttcctgaacc agtgctctgt aagctatctg 2520 

atgaactcca tgatcccgta cggtgttaaa cgtctggagg acttcgatgc gtctctgaaa 2580 

gacgccctgc tgaaatacat ttacgacaac cgtggcactc tgatcggtca ggttgatcgt 2640 
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ctgaaggaca aagtgaacaa taccttatcg accgacatcc cttttcagct cagtaaatat 2700 
gtcgataacc aacgcctttt gtccactcta gactag 2736 



<210> 


64 


<211> 


911 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


64 



Leu Gly lie Glu Gly Arg Phe Gly Qly Phe Thr Gly Ala Arg Lys Ser 
15 10 15 



Ala Arg Lys Leu Ala Asn Gin Thr Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 



Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg Gly Ser Met Glu Phe Val 
35 40 45 



Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly Val Asp lie Ala 
50 55 60 



Tyr lie Lys lie Pro Asn Ala Gly Gin Met Gin Pro Val Lys Ala Phe 
65 70 75 80 



Lys lie His Asn Lys lie Trp Val lie Pro Glu Arg Asp Thr Phe Thr 
85 90 95 



Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala Lys Gin Val 
100 105 110 



Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp Asn Glu Lys 
115 120 125 



Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu Arg lie Tyr Ser 
130 135 140 



Thr Asp Leu Gly Arg Met Leu Leu Thr Ser lie Val Arg Gly He Pro 
145 150 155 160 



Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys Val He Asp Thr 
165 170 175 

Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr Arg Ser Glu Glu 
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180 185 190 



Leu Asn Leu Val lie lie Gly Pro Ser Ala Asp He He Gin Phe Glu 
195 200 205 



Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr Arg Asn Gly Tyr 
210 215 220 



Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe Thr Phe Gly Phe 
225 230 235 240 



Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly Ala Gly Lys 
245 250 255 



Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu Leu He His Ala 
260 265 270 



Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn Arg Val Phe Lys 
275 280 285 



Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu Val Ser Phe 
290 295 300 



Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe He Asp Ser 
305 310 315 320 



Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys Phe Lys Asp 
325 330 335 



He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val Gly Thr Thr Ala 
340 345 350 



Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr Leu Leu Ser 
355 .360 365 



Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys Phe Asp Lys 
370 375 380 



Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp Asn Phe Val Lys 
385 390 395 400 



Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe Asp Lys Ala 
405 410 415 



Val Phe Lys He Asn -,He Val ^Pro Lys Val Asn Tyr Thr He Tyr Asp 
420 425 430 
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Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe Asn Gly Gin 
435 440 445 



Asn Thr Glu lie Asn Asn Met Asn Phe Thr Lys Leu Lys Asn Phe Thr 
450 455 460 



Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly lie lie Thr 
465 470 475 480 



Ser Lys Thr Lys Ser Leu lie Glu Gly Arg Asn Lys Ala Leu Asn Leu 
485 490 495 



Gin Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie Thr Ser Asp 
515 520 525 



Thr Asn lie Glu Ala Ala Glu Glu Asn lie Ser Leu Asp Leu He Gin 
530 535 540 



Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser 
545 550 555 560 



He Glu Asn Leu Ser Ser Asp life He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 605 



Arg He Ala Leu- Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys 
625 630 635 640 



Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala 

.:.=^ w: ... 660 .o " *. 665 - ^ . • 670 
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Asp lie Thr lie lie He Pro Tyr He Gly Pro Ala Leu Asn He Gly 
675 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly 
690 695 700 



Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val 
725 730 735 



Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
770 775 780 



Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 825 830 



Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg 
865 870 875 880 



Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin 
885 890 895 



Leu. Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 
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<210> 65 
<211> 177 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 65 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggtggtggtg gttctgcact agtgctgcag acgcacggtc tagaatgata aaagctt 177 

<210> 66 
<211> 192 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 66 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggtggtggtg gttctggtgg tggtggttct gcactagtgc tgcagacgca cggtctagaa 180 
tgataaaagc tt 192 

<210> 67 
<211> 222 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 67 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggtggtggtg gttctggtgg tggtggttct ggtggtggtg gttctggtgg tggtggttct 180 
gcactagtgc tgcagacgca cggtctagaa tgataaaagc tt 222 



<210> 68 

<211> 237 

<212> DNA 

<213> Artificial Sequence 
<220:> 

<223> Synthetic 

<400> 68 
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ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 

ggtggtggtg gttctggtgg tggtggttct ggtggtggtg gttctggtgg tggtggttct 180 

ggtggtggtg gttctgcact agtgctgcag acgcacggtc tagaatgata aaagctt 237 



<210> .,69 

<211> 228 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 69 



ggatccacgc 


acgtcgacgg 


catcattacc tccaaaacta aatctctgat agaaggtaga 


60 


tttggcggtt 


tcacgggcgc 


acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 


120 


gctgaagctg 


ctgctaaaga 


agctgctgct aaagaagctg ctgctaaagc tggtggcggt 


180 


ggttccgcac tagtgctgca gacgcacggt ctagaatgat aaaagctt 


228 


<210> 70 
<211> 2694 
<212> DNA 

<213> Artificial Sequence 




<220> 

<223> Synthetic 






<400>. 70 
ggatccatgg 


agttcgttaa 


caaacagttc aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg actagcatcg ttcgcggtat cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt ggtcacgaag ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg ggcgctggta aattcgcaac tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca ggccaccgcc tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac gcgtattacg agatgtccgg tctggaagtt 


780 
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agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggtggt 


ggtggttctg 


gtggtggtgg 


ttctgcacta 


1440 


gtgctgcagt 


gtatcaaggt 


taacaactgg 


gatttattct 


tcagcccgag 


tgaagacaac 


1500 


ttcaccaacg 


acctgaacaa 


aggtgaagaa 


atcacctcag 


atactaacat 


cgaagcagcc 


1560 


gaagaaaaca 


tctcgctgga 


cctgatccag 


cagtactacc 


tgacctttaa 


tttcgacaac 


1620 


gagccggaaa 


acatttctat 


cgaaaacctg 


agctctgata 


tcatcggcca 


gctggaactg 


1680 


atgccgaaca 


tcgaacgttt 


cccaaacggt 


aaaaagtacg 


agctggacaa 


atataccatg 


1740 


ttccactacc 


tgcgcgcgca 


ggaatttgaa 


cacggcaaat 


cccgtatcgc 


actgactaac 


1800 


tccgttaacg 


aagctctgct 


caacccgtcc 


cgtgtataca 


ccttcttctc 


tagcgactac 


1860 


gtgaaaaagg 


tcaacaaagc 


gactgaagct 


gcaatgttct 


tgggttgggt 


tgaacagctt 


1920 


gtttatgatt 


ttaccgacga 


gacgtccgaa 


gtatctacta 


ccgacaaaat 


tgcggatatc 


1980 


actatcatca 


tcccgtacat 


cggtccggct 


ctgaacattg 


gcaacatgct 


gtacaaagac 


2040 


gacttcgttg 


gcgcactgat 


cttctccggt 


gcggtgatcc 


tgctggagtt 


catcccggaa 


2100 


atcgccatcc 


cggtactggg 


cacctttgct 


ctggtttctt 


acattgcaaa 


caaggttctg 


2160 


actgtacaaa 


ccatcgacaa 


cgcgctgagc 


aaacgtaacg 


aaaaatggga 


tgaagtttac 


2220 


aaatatatcg 


tgaccaactg 


gctggctaag 


gttaatactc 


agatcgacct 


catccgcaaa 


2280 


aaaatgaaag 


aagcactgga 


aaaccaggcg 


gaagctacca 


aggcaatcat 


taactaccag 


2340 


tacaaccagt 


acaccgagga 


agaaaaaaac 


aacatcaact 


tcaacatcga 


cgatctgtcc 


2400 


tctaaactga 


acgaatccat 


caacaaagct 


atgatcaaca 


tcaacaagtt 


cctgaaccag 


2460 


tgctctgtaa 


gctatctgat 


gaactccatg 


atcccgtacg 


gtgttaaacg 


tctggaggac 


2520 


ttcgatgcgt 


ctctgaaag?. 


cgccctgctg 


aaatacattt 


acgacaaccg 


tggcactctg 


2580 
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atcggtcagg ttgatcgtct gaaggacaaa gtgaacaata ccttatcgac cgacatccct 2640 
tttcagctca gtaaatatgt cgataaccaa cgccttttgt ccactctaga ctag 2694 



<210> 


71 


<211> 


897 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


71 



Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
.130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



:iieUfTte' ArgvAss, QlyiTyrr>ai?-yf;.Ser -Thr.Qln,.-Tyr He JVrg Phe JSer Pro 
180 185 190 
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Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 

Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 

His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly lie Ala He Asn 
225 230 235 240 

Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 

Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 

Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 

Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 

He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 

Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 

Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 

Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 

Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 

Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 

Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 

Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
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Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu 
465 470 475 480 



Val Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro 
485 490 495 



Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr 
500 505 510 



Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu 
515 520 525 



He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn 
530 535 540 



He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu 
545 550 555 560 



Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp 
565 570 575 



Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly 
580 585 590 



Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn 
595 600 605 



Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val 
610 615 620 



Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu 
625 630 635 640 



Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys 
645 650 655 



He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn 
660 665 670 
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lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie Phe 
675 680 685 



Ser Gly Ala Val lie Leu Leu Glu Phe lie Pro Glu lie Ala He Pro 
690 695 700 



Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu 
705 710 715 720 



Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp 
725 730 735 



Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn 
740 745 750 



Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn 
755 760 765 



Gin Ala Glu Ala Thr Lys Ala He He Asn' Tyr Gin Tyr Asn Gin Tyr 
770 775 780 



Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser 
785 790 795 800 



Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys 
805 810 815 



Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro 
820 825 830 



Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala 
835 840 845 



Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val 
850 855 860 



Asp. Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro 
865 870 875 880 



Phe Gin Leu Ser Lys . Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu 
885 890 895 



Asp 



<210> 72 
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<211> 2724 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 72 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggtggt 


ggtggttctg 


gtggtggtgg ttctggtggt 


1440 


ggtggttctg 


gtggtggtgg 


ttctgcacta 


gtgctgcagt 


gtatcaaggt taacaactgg 


1500 


gatttattct 


tcagcccgag 


tgaagacaac 


ttcaccaacg 


acctgaacaa aggtgaagaa 


1560 


atcacctcag 


atactaacat 


cgaagcagcc 


gaagaaaaca 


tctcgctgga cctgatccag 


1620 
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tgacctutaa 


ttitcgacaac gagccggaaa acatttctat cgaaaacctg 


1 f O A 




tcatcggcca 


gctggaactg atgccgaaca tcgaacgttt cccaaacggt 


1 'I >! A 
1 /40 


aaaaacftacg 


agctggacaa 


atataccatg ttccactacc tgcgcgcgca ggaatttgaa 


1 O A A 

XoUO 




cccgtatcgc 


actgactaac tccgttaacg aagctctgct caacccgtcc 


1860 


cgtgtataca 


ccttcttctc 


tagcgactac gtgaaaaagg tcaacaaagc gactgaagct 


1920 


gcaatgttct 


tgggttgggt 


tgaacagctt gtttatgatt ttaccgacga gacgtccgaa 


1980 


gtatctacta 


ccgacaaaat 


tgcggatatc actatcatca tcccgtacat cggtccggct 


2040 


ctgaacattg 


gcaacatgct 


gtacaaagac gacttcgttg gcgcactgat cttctccggt 


2100 


gcggtgatcc 


tgctggagtt 


catcccggaa atcgccatcc cggtactggg cacctttgct 


2160 


ctggtttctt 


acattgcaaa 


caaggttctg actgtacaaa ccatcgacaa cgcgctgagc 


2220 


aaacgtaacg 


aaaaatggga 


tgaagtttac aaatatatcg tgaccaactg gctggctaag 


2280 


gttaatactc 


agatcgacct 


catccgcaaa aaaatgaaag aagcactgga aaaccaggcg 


2340 


gaagctacca 


aggcaatcat 


taactaccag tacaaccagt acaccgagga agaaaaaaac 


2400 


aacatcaact 


tcaacatcga 


cgatctgtcc tctaaactga acgaatccat caacaaagct 


2460 


atgatcaaca 


tcaacaagtt 


cctgaaccag tgctctgtaa gctatctgat gaactccatg 


2520 


atcccgtacg 


gtgttaaacg 


tctggaggac ttcgatgcgt ctctgaaaga cgccctgctg 


2580 


aaatacattt 


acgacaaccg 


tggcactctg atcggtcagg ttgatcgtct gaaggacaaa 


2640 


gtgaacaata 


ccttatcgac 


cgacatccct tttcagctca gtaaatatgt cgataaccaa 


2700 


cgccttttgt 


ccactctaga 


ctag 


2724 



<210> 73 

<211> 907 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 73 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 " 10 15 

Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 
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Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 • 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 
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Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 

325 - . - 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly. Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys 
485 490 495 



Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr 
500 505 510 



Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp Thr Asn He Glu 
515 520 525 



Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu 
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530 535 540 



Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser He Glu Asn Leu 
545 550 555 560 



Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg 
565 570 575 



Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His 
580 585 590 



Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser Arg He Ala Leu 
595 600 605 



Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr 
610 615 620 



Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala 
625 630 635 640 



Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp 
645 650 655 



Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala Asp He Thr He 
660 665 670 



He He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr 
675 680 685 



Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly Ala Val He Leu 
690 695 700 



Leu Glu Phe He Pro Glu He Ala He Pro Val Leu Gly Thr Phe Ala 
705 710 715 720 



Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val Gin Thr He Asp 
725 730 735 



Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr 
740 745 750 



He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin He Asp Leu He 
755 760 765 



Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Gin Ala Thr Lys 
770 775 780 
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Ala lie lie Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn 
785 790 795 800 



Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser 
805 610 815 



lie Asn Lys Ala Met He Asn lie Asn Lys Phe Leu Asn Gin Cys Ser 
820 825 830 



Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu 
835 840 845 



Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr 
850 855 860 



Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys 
865 870 875 880 



Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr 
885 890 895 



Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 



<210> 74 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 74 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctgacga tgacgataaa 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gtaagaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 



<210> 75 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 75 
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ggatccatgg 


agttcgttaa caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


gscggttcta 


ccatcgatac cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctg 


acgatgacga taaatttggc 


•ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga ctaactccgt taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 



wo 2006/059105 



131 



PCT/GB2005/004598 



ttctctagcg 


actacgtgaa 


aaaggtcaac aaagcgactg aagctgcaat gttcttgggt 


1 ft O A 

1920 


tgggttgaac 


agcttgttta 


tgattttacc gacgagacgt ccgaagtatc tactaccgac 




aaaattgcgg 


atatcactat 


catcatcccg tacatcggtc cggctctgaa cattggcaac 


0 A y1 A 

2040 


atgctgtaca 


aagacgactt 


cgttggcgca ctgatcttct ccggtgcggt gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta ctgggcacct ttgctctggt ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc gacaacgcgc tgagcaaacg taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc aactggctgg ctaaggttaa tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca ctggaaaacc aggcggaagc taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc gaggaagaaa aaaacaacat caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa tccatcaaca aagctatgat caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat ctgatgaact ccatgatccc gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg aaagacgccc tgctgaaata catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat cgtctgaagg acaaagtgaa caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa tatgtcgata accaacgcct tttgtccact 


2700 


ctagactag 






2709 



<210> 76 
<211> 902 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 76 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 

Asn Gly Val Asp He Ala Tyr lie Lys He Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys lie Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



(Sly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 . 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp Lys 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn 
515 520 525 



lie Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn lie Ser lie Glu Asn Leu Ser Ser Asp lie lie 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn lie Glu Arg Phe Pro Asn Gly Lys 
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565 



570 



575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys lie Ala Asp lie Thr lie lie lie Pro Tyr lie 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 
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lie Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 77 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 77 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gtaagaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 

<210> 78 
<211> 2742 
<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> Synthetic 
<400> 78 

ggatccgaat tcatgccgat caccatcaac aacttcaact acagcgatcc ggtggataac 60 
aaaaacatcc tgtacctgga tacccatctg aataccctgg cgaacgaacc ggaaaaagcg 120 
tttcgtatca ccggcaacat ttgggttatt ccggatcgtt ttagccgtaa cagcaacccg 180 
aatctgaata.aarccgqcacg fegfet^qciagceCcgaaiaagGgrgttattacga tccgaactat 240 
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ctgagcaccg 


atagcgataa 


agataccttc 


ctgaaagaaa 


tcatcaaact 


gttcaaacgc 


300 


atcaacagcc 


gtgaaattgg 


cgaagaactg 


atctatcgcc 


tgagcaccga 


tattccgttt 


360 


ccgggcaaca 


acaacacccc 


gatcaacacc 


tttgatttcg 


atgtggattt 


caacagcgtt 


420 


gatgttaaaa 


cccgccaggg 


taacaattgg 


gtgaaaaccg 


gcagcattaa 


cccgagcgtg 


480 


attattaccg 


gtccgcgcga 


aaacattatt 


gatccggaaa 


ccagcacctt 


taaactgacc 


540 


aacaacacct 


ttgcggcgca 


ggaaggtttt 


ggcgcgctga 


gcattattag 


cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc 


aacgatgttg 


gtgaaggccg 


tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc 


ctgatgcatg 


aactgaacca 


tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat 


cagaccatta 


gcagcgtgac 


cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa 


tatgcggaaa 


tctatgcgtt 


tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc 


aaatacttcg 


aagaaaaagc 


gctggattac 


900 


tatcgcagca 


ttgcgaaacg 


tctgaacagc 


attaccaccg 


cgaatccgag 


cagcttcaac 


960 


aaatatatcg 


gcgaatataa 


acagaaactg 


atccgcaaat 


atcgctttgt 


ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa 


ttcgtggaac 


tgtacaacga 


actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc 


tataacgtgc 


agaaccgtaa 


aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat 


attctggatg 


ataacgtgta 


cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg 


aacgttctgt 


ttatgggcca 


gaacctgagc 


1260 


cgtaatccgg 


cgctgcgtaa 


agtgaacccg 


gaaaacatgc 


tgtacctgtt 


caccaaattt 


1320 


tgcgtcgacg 


gcatcattac 


ctccaaaact 


aaatctctga 


tagaaggtag 


atttggcggt 


1380 


ttcacgggcg 


cacgcaaatc 


agcgcgtaaa 


cgtaagaacc 


aggcgctagc 


gggcggtggc 


1440 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagcgcac 


tagtgctgca 


gtgtcgtgaa 


1500 


ctgctggtga 


aaaacaccga 


tctgccgttt 


attggcgata 


tcagcgatgt 


gaaaaccgat 


1560 


atcttcctgc 


gcaaagatat 


caacgaagaa 


accgaagtga 


tctactaccc 


ggataacgtg 


1620 


agcgttgatc 


aggtgatcct 


gagcaaaaac 


accagcgaac 


atggtcagct 


ggatctgctg 


1680 


tatccgagca 


ttgatagcga 


aagcgaaatt 


ctgccgggcg 


aaaaccaggt 


gttttacgat 


1740 


aaccgtaccc 


agaacgtgga 


ttacctgaac 


agctattact 


acctggaaag 


ccagaaactg 


1800 


agcgataacg 


tggaagattt 


tacctttacc 


cgcagcattg 


aagaagcgct 


ggataacagc 


1860 


gcgaaagttt 


acacctattt 


tccgaccctg 


gcgaacaaag 


ttaatgcggg 


tgttcagggc 


1920 


ggtctgtttc 


tgatgtgggc 


gaacgatgtg 


gtggaagatt 


tcaccaccaa 


catcctgcgt 


1980 


aaagataccc 


tggataaaat 


cagcgatgtt 


agcgcgatta 


ttccgtatat 


tggtccggcg 


2040 


ctgaacatta gcaatagcgt 


gcgtcgtggc 


aattttaccg 


aagcgtttgc 


ggttaccggt 


2100 
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gtgaccattc 


tgctggaagc 


gtttccggaa 


tttaccattc 


cggcgctggg 


tgcgtttgtg 


2160 


atctatagca 


aagtgcagga 


acgcaacgaa 


atcatcaaaa 


ccatcgataa 


ctgcctggaa 


2220 


cagcgtatta 


aacgctggaa 


agatagctat 


gaatggatga 


tgggcacctg 


gctgagccgt 


2280 


attatcaccc 


agttcaacaa 


catcagctac 


cagatgtacg 


atagcctgaa 


ctatcaggcg 


2340 


ggtgcgatta 


aagcgaaaat 


cgatctggaa 


tacaaaaaat 


acagcggcag 


cgataaagaa 


2400 


aacatcaaaa 


gccaggttga 


aaacctgaaa 


aacagcctgg 


atgtgaaaat 


tagcgaagcg 


2460 


atgaataaca 


tcaacaaatt 


catccgcgaa 


tgcagcgtga 


cctacctgtt 


caaaaacatg 


2520 


ctgccgaaag 


tgatcgatga 


actgaacgaa 


tttgatcgca 


acaccaaagc 


gaaactgatc 


2580 


aacctgatcg 


atagccacaa 


cattattctg 


gtgggcgaag 


tggataaact 


gaaagcgaaa 


2640 


gttaacaaca 


gcttccagaa 


caccatcccg 


tttaacatct 


tcagctatac 


caacaacagc 


2700 


ctgctgaaag 


atatcatcaa 


cgaatacttc 


aatctagact 


ag 




2742 



<210> 79 
<211> 913 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 79 

t 

Gly Ser Glu Phe Met Pro He Thr He Asn Asn Phe Asn Tyr Ser Asp 
15 10 15 



Pro Val Asp Asn Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp 
35 40 45 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 

.-^i. .>3rOat ,^ ^ .... : ..105. • 110 
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Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
130 135 140 



Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val 
145 150 155 160 



He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 



Thr Ser Asn He Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He 
290 295 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr He Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 
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Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin 
385 390 395 400 



Asn Gly Phe Asn He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 430 



Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Gly He He Thr Ser 
435 440 445 



Lys Thr Lys Ser Leu He Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala 
450 455 460 



Arg Lys Ser Ala Arg Lys Arg Lys Asn Gin Ala Leu Ala Gly Gly Gly 
465 470 475 480 



Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu 
485 490 495 



Gin Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe He Gly 
500 505 510 



Asp He Ser Asp Val Lys Thr Asp He Phe Leu Arg Lys Asp He Asn 
515 520 525 



Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser Val Asp Gin 
530 535 540 



Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu Asp Leu Leu 
545 550 555 560 



Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly Glu Asn Gin 
565 570 575 



Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu Asn Ser Tyr 
580 585 590 



Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu Asp Phe Thr 
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595 600 605 



Phe Thr Arg Ser lie Glu Glu Ala Leu Asp Asn Ser Ala Lys Val Tyr 
610 615 620 



Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val Gin Gly 
625 630 635 640 



Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp Phe Thr Thr 
645 650 655 



Asn lie Leu Arg Lys Asp Thr Leu Asp Lys lie Ser Asp Val Ser Ala 
660 665 670 



lie He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn Ser Val Arg 
675 680 685 



Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val Thr He Leu 
690 695 700 



Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly Ala Phe Val 
705 710 715 720 



He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys Thr He Asp 
725 730 735 



Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys .Asp Ser Tyr Glu Trp 
740 745 750 



Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe Asn Asn He 
755 760 765 



Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly Ala He Lys 
770 775 780 



Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys* Glu 
785 790 795 800 



Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys 
805 810 815 



He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser 
820 825 830 



Val Thr Tyr Leu Phe Lys Asn Met Leu Pre Lys Val He Asp Glu Leu 
835 840 845 
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Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn Leu He Asp 
850 855 860 

Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu Lys Ala Lys 
865 870 875 880 

Val Asn Asn Ser Phe Gin Asn Thr He Pro Phe Asn He Phe Ser Tyr 
885 890 895 

Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Leu 
900 905 910 

Asp 



<210> 80 
<211> 2673 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 80 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 

attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 

cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 

ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt.ntccgt^r.tgta-..ctactab£aG.*.aa"gttf2aaa'g at caccctgaac - 900 
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aaagcgaaat ccatcgtggg 


taccactgct tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc tgctcagcga 


agacacctcc ggcaaattct 


ctgtagacaa gttgaaattc 


1020 


gataaacttt acaaaatgct 


gactgaaatt tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga accgcaaaac 


ctatctgaac ttcgacaagg 


cagtattcaa aatcaacatc 


1140 


gtgccgaaag ttaactacac 


tatctacgat ggtttcaacc 


tgcgtaacac caacctggct 


1200 


gctaatttta acggccagaa 


cacggaaatc aacaacatga 


acttcacaaa actgaaaaac 


1260 


ttcactggtc tgttcgagtt 


ttacaagctg ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc tgatagaagg 


tagatacggt ggtttcctgg 


cgctagcggg cggtggcggt 


1380 


agcggcggtg gcggtagcgg 


cggtggcggt agcgcactag 


tgctgcagtg tatcaaggtt 


1440 


aacaactggg atttattctt 


cagcccgagt gaagacaact 


tcaccaacga 


cctgaacaaa 


1500 


ggtgaagaaa tcacctcaga 


tactaacatc gaagcagccg 


aagaaaacat 


ctcgctggac 


1560 


ctgatccagc agtactacct 


gacctttaat ttcgacaacg 


agccggaaaa 


catttctatc 


1620 


gaaaacctga gctctgatat 


catcggccag ctggaactga 


tgccgaacat 


cgaacgtttc 


1680 


ccaaacggta aaaagtacga 


gctggacaaa tataccatgt 


tccactacct 


gcgcgcgcag 


1740 


gaatttgaac acggcaaatc 


ccgtatcgca ctgactaact 


ccgttaacga 


agctctgctc 


1800 


aacccgtccc gtgtatacac 


cttcttctct agcgactacg 


tgaaaaaggt 


caacaaagcg 


1860 


actgaagctg caatgttctt 


99rgttgggtt gaacagcttg 


tttatgattt 


taccgacgag 


1920 


acgtccgaag tatctactac 


cgacaaaatt gcggatatca 


ctatcatcat 


cccgtacatc 


1980 


ggtccggctc tgaacattgg 


caacatgctg tacaaagacg 


acttcgttgg 


cgcactgatc 


2040 


ttctccggtg cggtgatcct 


gctggagttc atcccggaaa 


tcgccatccc 


ggtactgggc 


2100 


acctttgctc tggtttctta 


cattgcaaac aaggttctga 


ctgtacaaac 


catcgacaac 


2160 


gcgctgagca aacgtaacga 


aaaatgggat gaagtttaca 


aatatatcgt gaccaactgg 


2220 


ctggctaagg ttaatactca 


gatcgacctc atccgcaaaa 


aaatgaaaga 


agcactggaa 


2280 


aaccaggcgg aagctaccaa 


ggcaatcatt aactaccagt 


acaaccagta 


caccgaggaa 


2340 


gaaaaaaaca acatcaactt 


caacatcgac gatctgtcct 


ctaaactgaa 


cgaatccatc 


2400 


aacaaagcta tgatcaacat 


caacaagttc ctgaaccagt 


gctctgtaag ctatctgatg 


2460 


aactccatga tcccgtacgg 


tgttaaacgt ctggaggact 


tcgatgcgtc 


tctgaaagac 


2520 


gccctgctga aatacattta 


cgacaaccgt ggcactctga 


tcggtcaggt 


tgatcgtctg 


2580 


aaggacaaag tgaacaatac 


cttatcgacc gacatccctt 


ttcagctcag taaatatgtc 


2640 


gataaccaac gccttttgtc 


cactctagac tag 






2673 
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<210> 


81 


<211> 


890 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


81 



Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
-i:^' -.M 195' * . : - 200 - ' 205 
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Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 
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Tyr Gly Gly Phe Leu Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly 
450 455 460 



Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val 
465 470 475 480 



Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn 
485 490 495 



Asp Leu Asn Lys Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala 
500 505 510 



Ala Glu Glu Asn lie Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr 
515 520 525 



Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser 
530 535 540 



Ser Asp He He Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe 
545 550 555 560 



Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr 
565 570 575 



Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr 
580 585 590 



Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe 
595 600 605 



Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala 
610 615 620 



Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu 
625 630 635 640 



Thr. Ser Glu Val Ser Thr Thr Asp Lys He Ala Asp He Thr He He 
645 650 655 



He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys 
660 665 670 



Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu 
675 680 685 



Glu Phe He Pro Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu 
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690 695 700 



Val Ser Tyr He Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn 
705 710 715 720 



Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He 
725 730 735 



Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg 
740 745 750 



Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala 
755 760 765 



He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn 
770 775 780 



He Asn Phe Asn He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He 
785 790 795 800 



Asn Lys Ala Met He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val 
805 810 815 



Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu 
820 825 830 



Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp 
835 840 845 



Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val 
850 855 860 



Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val 
865 870 875 880 



Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 



<210> 82 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 82 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
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4* -4* jr^r 4^ 4* ^ ^ ^ 


tcaaaat^ccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


1 OA 


cd.ccia.caaaa 


tccgggutat 


cc cggaacg t 


gataccttta 


ctaacccgga 


agaaggtgac 


1 Q A 


^tm ^ A 

Ct9^aCCC9C 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatatggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagcgaag 


acaactticac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aaca^cgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt- 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 
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ttctctagcg 


actacgtgaa 


aaaggtcaac aaagcgactg aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc gacaacgcgc 


tgagcaaacg taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc gaggaagaaa aaaacaacat 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa tccatcaaca aagctatgat 


caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat ctgatgaact 


ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat cgtctgaagg 


acaaagtgaa 


caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa tatgtcgata accaacgcct 


tttgtccact 


2700 


ctagactag 










2709 



<210> 83 

<211> 902 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 83 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 ■ 15 

Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 

Glu Arg Asp Thr ' Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
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325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro., Asn lie Glu Arg Phe Pro Asn Gly Lys 
565 570 575 
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Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 

~ -J' '! -^605 . . - .-.^.j .1. , .310 -r - . - 815 
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lie Asn lie Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met lie Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 84 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 

<400> 84 



ggatccatgg 


agttcgttaa 


caaacagttc aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt cacgacgcta 


aattcatcga 


ctctctgcaa 


840 
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tccgtclig'ba 


ccacuacaac 


aagcucaaag 


auaccgcauc 


caccc^gaac 






ccatcgtggg 


taccac^gct 


tctctccagt: 


acatgaagaa 


cgcu uccaaa 


C\ /T f\ 




cgcccagcga 


agacaccucc 


ggcaaauCCu 


ccgcagacaa 


guugaaaccc 




9at:aaactct 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagccccct 


1080 


aaagt^tctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatatggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


■ till — _■ 

tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt: 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


cbgggcacct 


ttgctctggt 


ctcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc 


gacaacgcgc 


tgagcaaacg 


taacgaaaaa 


2220 


tgggatgaag 


^ttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggtbaa 


tactcagatc 


2280 


gacctcat^cc 


gcaaaaaaat 


g aaagaagc a 


ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 


aucacuaacc 


accagtacaa 


ccagtacacc 


gaggaagaaa 


aaaacaacat 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca 


aagctatgat 


caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat 


ctgatgaact 


ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg 


aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaqcgtggca 


.ctctgaccgg 


tcaggttgat 


cgtctgaagg 


acaaagtgaa 


caatacctts 


2640 
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tcgaccgaca tcccttttca gctcagtaaa tatgtcgata accaacgcct tttgtccact 2700 
ctagactag 2709 



<210> 85 
<211> 902 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 85 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 • 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



li^'J^h^ Arg^Tj^an.cGiy 5Lyr GJ.y Ser Thr. -Gin. Tyi:. lie; Arg Phe Ser Pro 
180 185 190 
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Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly lie Ala lie Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
.s.^'.r "'420 : . ^ 425'-';pj .'.1 m * -' 430 
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Val Asp Gly lie He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 
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Gly Pro Ala Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu lie Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



•Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



lie Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 86 
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<211> 2889 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic 










<400> 86 
acratccttcro 


tacgagatga 


cgttgactat 


caaattttcc 


gcgactttgc ggaaaataaa 


60 


ciah aacrt* 1 1 1 


tccitcacrcac 

w>v*y u^yy ^i^y %•» 


cacagacctg 


tccgtcaaaa 


ataagagagg ccagaacatc 


120 


crcrt' a a r'crr' a c 


tcracf paac at" 

L»y »y i«»ci»v*y t«» 


ccctatgatt 


gattttagtg 


tagcggacgt taataaacgg 


180 


a ^ ^ crr»a a ocf 

Cl l« Cy ^ C1CIIm> y 


L»Vi>y 1* uy ci 


gcagtatgct 


gtcagcgtca 


aacatgctaa agcggaagtt 


240 




at* ^ a pacicf OA 


atataacggc 


cataacgatg 


tggctgataa agaaaatgaa 


300 


^ a ^ r^r^r*<*Tl~ net 


t" r^rra rf r« a rra a 
i-\.>y cLy ^ciy clci 


caattacgaa 


ccgcacaaag 


cgtggggcgc gagtaattta 


360 




arrna r*^ at* a a 


catggcccgt 


ttcaataaat 


tcgtgaccga ggtagcaccg 


420 




r»a fra t" ctc^ i~ act 
vociy ciuy w> v^yy 


tgggggcctg 


gatacctaca 


aagataaaaa ccgcttctct 


480 




frr»ai" f"orfr«rTr» 
y ^ ci u uy y wy 


cggtcgtcag 


ctcgtgtacg 


agaagggtgt ctatcaccag 


540 


na a nt\ ^ s a ^ rt 


a a a a ctrtnct^ a 
actaayy y y 


cgacctccgt 


gatttgtccc 


aggcgtatcg ctacgctatt 


600 




r»rr1~ a t~ a a a np* 
^y ud i.cicici.y d 


tattaatatc 


gatcaaacca 


tgaataccga aggcctaatt 


660 


y y t. u t. uy y y « 


at*r*ai*aa'^aa 


gcaatatagc 


gcagaagagc 


taaagcaggc cctcagccaa 


720 




r«r*aa +" t" a r*ftrt 
1^ ^ ctci u A V* y y 


agtgttaggc 


gatagcggca 


gtccgctgtt tgccttcgat 


780 


a a a *^a na a a a 
aacLwayadaa 


ai* ^a airtcKT^ 
ctuLtCiciwyyy i» 


gtttctgggc 


acttatgatt 


attgggccgg atatggtaaa 


840 


a a rt ?^ rr r« t" rrrr 


a cfCf a a t* Cf era a 

^yy ^ yy 


tatttataaa 


aaggaattcg 


cagacaaaat caagcagcat 


900 


y A Va> ci ci w y ^ ay 


CI t" a c crci t era a 

y UAwyy i»y»M 


ggggaacggc 


gaacatcact 


ggaagacgac cggcacgaat 


960 


oat" atecf 

ciy k>>M niicCi wv»y 


cratcaaccrac 


cgttcgcctg 


gcgaacaatg 


agggcgatgc aaacaatggg 


1020 


c a.aa.a.ccr t cr a 


cctttgagga 


caacggtacc 


ctggtcctta 


accagaacat aaatcagggc 


1080 


crccrcrcra.aac t 


tgttctttaa 


aggcgactat 


actgttaagg 


gagcaaacaa tgacatcacc 


1140 


tcrcrt t aoaca 


ccQQtattcra 


cgttgcggat 


ggaaaaaagg 


tggtttggca ggttaaaaac 


1200 


cctaacgggg 


accggctggc 


aaaaatcggc 


aaagggacat 


tggaaattaa tggtaccggt 


1260 


gtgaatcagg 


gtcagctgaa 


agtgggagat 


gggaccgtga 


ttctgaacca gaaagcagac 


1320 


gctgacaaaa 


aggtgcaagc 


ctttagccaa 


gtaggaattg 


ttagtggtcg tggcacactc 


1380 


gtcttgaact 


caagcaacca 


aataaatccg 


gataacctgt 


actttggatt tcgtggcgga 


1440 


cgcctggatg 


Gtaacgggaa 


tgatctgacc 


tttgaacata 


tccgtaacgt tgacgagggt 


1500 


gcgcgcatag 

•I 


ttaatcataa 


tactgaccat 


gcatcaacta 


tcaccttgac cgggaaaagt 


1560 


ctgattacaa 


acccaaactc 


tctgtcagta 


cattccatcc 


agaatgatta tgatgaagac 


1620 
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gattactcat 


actattaccg 


gccgcgtaga ccaattccac aaggtaaaga tctttattac 


1680 


aaaaattacc 


gttattacgc 


attaaaatcc ggagggcggc tgaatgcacc tatgccggaa 


1740 


aatggcgtgg 


ccgaaaacaa 


tgactggatt tttatgggtt atactcaaga agaggctcgc 


1800 


aaaaatgcaa 


tgaaccataa 


aaataaccga aggatcggtg atttcggcgg atttttcgat 


1860 


gaggaaaatg 


gtaaaggtca 


caatggtgcg ctgaatctaa attttaacgg caaaagtgcc 


1920 


cagaaacgtt 


tccttctgac 


tggtggcgct aatctgaatg gtaaaatcag tgtgacgcag 


1980 


ggtaacgtgc 


tgctttctgg 


ccggccaact ccgcatgcac gtgattttgt aaataaatcg 


2040 


agcgctcgta 


aagatgcgca 


tttttctaaa aataacgagg tcgtgtttga agatgactgg 


2100 


ataaatcgca 


cctttaaagc 


ggcagaaatc gcggttaatc agagtgcgag cttttcatcg 


2160 


ggtaggaatg 


tatctgatat 


tacagcaaac attacagcca ctgataatgc gaaggtcaac 


2220 


ctgggttata 


aaaacggtga 


tgaagtttgt gttcgatcgg attacacggg ctatgttacc 


2280 


tgcaacactg 


gcaatctgtc 


tgataaagcg cttaactctt ttgacgccac gcgcattaac 


2340 


gggaatgtga 


acctgaacca 


aaacgctgcc ttggtacttg gtaaggccgc gttgtggggt 


2400 


aaaattcagg 


gccagggcaa 


ctcccgtgtg tctctgaacc agcactcgaa gtggcacctg 


2460 


acgggggact 


cgcaggtgca 


caacttgtcc ctggccgata gccatattca ccttaacaat 


2520 


gcgtccgatg 


cccagtcagc 


taataaatat catacgatca aaatcaatca cctctctggc 


2580 


aacggtcact 


ttcactactt 


aacggattta gcaaaaaact taggggataa agtcctggta 


2640 


aaagaatcag 


cgagcggaca 


ttatcagtta catgtacaga acaaaacagg cgagccaaat 


2700 


caggaaggcc 


ttgacttatt 


tgatgcttca tcggtacaag atcgttccag actgttcgtt 


2760 


tcactcgcga 


atcactacgt 


tgatctgggt gcgctgcgct atactataaa gacggaaaat 


2820 


ggcataacac 


gcctctataa 


tccctatgcc ggtaacggcc gtccggtgaa acctgctccc 


2880 


tgcgtcgac 






2889 



<210> 87 
<211> 4296 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 87 

ggatccttgg tacgagatga cgttgactat caaattttcc gcgactttgc ggaaaataaa 60 
ggtaagtttt tcgtcggcgc cacagacctg tccgtcaaaa ataagagagg ccagaacatc 120 
ggtaacgcac tgagcaacgt ccctatgatt gattttagtg tagcggacgt taataaacgg 180 
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attgcaaccg 


tcgttgatcc 


gcagtatgct 


gtcagcgtca 


aacatgctaa 


agcggaagtt 


240 


catacgttct 


attacgggca 


atataacggc 


cataacgatg 


tggctgataa 


agaaaatgaa 


300 


tatcgcgtgg 


tcgagcagaa 


caattacgaa 


ccgcacaaag 


cgtggggcgc 


gagtaattta 


360 


ggccgcctgg 


aggactataa 


catggcccgt 


ttcaataaat 


tcgtgaccga 


ggtagcaccg 


420 


atcgccccca 


cagatgctgg 


tgggggcctg 


gatacctaca 


aagataaaaa 


ccgcttctct 


480 


agcttcgtgc 


gcattggcgc 


cggtcgtcag 


ctcgtgtacg 


agaagggtgt 


ctatcaccag 


540 


gaaggtaatg 


aaaaggggta 


cgacctccgt 


gatttgtccc 


aggcgtatcg 


ctacgctatt 


600 


gccggaaccc 


cgtataaaga 


tattaatatc 


gatcaaacca 


tgaataccga 


aggcctaatt 


660 


ggtttcggga 


atcataataa 


gcaatatagc 


gcagaagagc 


taaagcaggc 


cctcagccaa 


720 


gatgcgttaa 


ccaattacgg 


agtgttaggc 


gatagcggca 


gtccgctgtt 


tgccttcgat 


780 


aaacagaaaa 


atcaatgggt 


gtttctgggc 


acttatgatt 


attgggccgg 


atatggtaaa 


840 


aagagctggc 


aggaatggaa 


tatttataaa 


aaggaattcg 


cagacaaaat 


caagcagcat 


900 


gacaacgcag 


gtacggtgaa 


ggggaacggc 


gaacatcact 


ggaagacgac 


cggcacgaat 


960 


agtcatatcg 


gatcgacggc 


cgttcgcctg 


gcgaacaatg 


agggcgatgc 


aaacaatggg 


1020 


caaaacgtga 


cctttgagga 


caacggtacc 


ctggtcctta 


accagaacat 


aaatcagggc 


1080 


gcgggaggct 


tgttctttaa 


aggcgactat 


actgttaagg 


gagcaaacaa 


tgacatcacc 


1140 


tggttagggg 


ccggtattga 


cgttgcggat 


ggaaaaaagg 


tggtttggca 


ggttaaaaac 


1200 


cctaacgggg 


accggctggc 


aaaaatcggc 


aaagggacat 


tggaaattaa 


tggtaccggt 


1260 


gtgaatcagg 


gtcagctgaa 


agtgggagat 


gggaccgtga 


ttctgaacca 


gaaagcagac 


1320 


gctgacaaaa 


aggtgcaagc 


ctttagccaa 


gtaggaattg 


ttagtggtcg 


tggcacactc 


1380 


gtcttgaact 


caagcaacca 


aataaatccg 


gataacctgt 


actttggatt 


tcgtggcgga 


1440 


cgcctggatg 


ctaacgggaa 


tgatctgacc 


tttgaacata 


tccgtaacgt 


tgacgagggt 


1500 


gcgcgcatag 


ttaatcataa 


tactgaccat 


gcatcaacta 


tcaccttgac 


cgggaaaagt 


1560 


ctgattacaa 


acccaaactc 


tctgtcagta 


cattccatcc 


agaatgatta 


tgatgaagac 


1620 


gattactcat 


a&tattaccg 


gccgcgtaga 


ccaattccac 


aaggtaaaga 


tctttattac 


1680 


aaaaattacc 


gttattacgc 


attaaaatcc 


ggagggcggc 


tgaatgcacc 


tatgccggaa 


1740 


aatggcgtgg 


ccgaaaacaa 


tgactggatt 


tttatgggtt 


atactcaaga 


agaggctcgc 


1800 


aaaaatgcaa 


tgaaccataa 


aaataaccga 


aggatcggtg 


atttcggcgg 


atttttcgat 


1860 


gaggaaaatg 


gtaaaggtca 


caatggtgcg 


ctgaatctaa 


attttaacgg 


caaaagtgcc 


1920 


cagaaacgtt 
ggtaacgtgc 


tccttctgac 


tggtggcgct 


aatctgaatg 


gtaaaatcag 


tgtgacgcag 


1980 


tgctttctgg 


ccggccaact 


ccgcatgcac 


gtgattttgt 


aaataaatcg 


2040 
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agcgctcgta 


aagatgcgca 


tttttctaaa 


aataacgagg 


tcgtgtttga 


agatgactgg 


2100 


ataaatcgca 


cctttaaagc 


ggcagaaatc 


gcggttaatc 


agagtgcgag 


cttttcatcg 


2160 


ggtaggaatg 


tatctgatat 


tacagcaaac 


attacagcca 


ctgataatgc 


gaaggtcaac 


2220 


ctgggttata 


aaaacggtga 


tgaagtttgt 


gttcgatcgg 


attacacggg 


ctatgttacc 


2280 


tgcaacactg 


gcaatctgtc 


tgataaagcg 


cttaactctt 


ttgacgccac 


gcgcattaac 


2340 


gggaatgtga 


acctgaacca 


aaacgctgcc 


ttggtacttg 


gtaaggccgc 


gttgtggggt 


2400 


aaaattcagg 


gccagggcaa 


ctcccgtgtg 


tctctgaacc 


agcactcgaa 


gtggcacctg 


2460 


acgggggact 


cgcaggtgca 


caacttgtcc 


ctggccgata 


gccatattca 


ccttaacaat 


2520 


gcgtccgatg 


cccagtcagc 


taataaatat 


catacgatca 


aaatcaatca 


cctctctggc 


2580 


aacggtcact 


ttcactactt 


aacggattta 


gcaaaaaact 


taggggataa 


agtcctggta 


2640 


aaagaatcag 


cgagcggaca 


ttatcagtta 


catgtacaga 


acaaaacagg 


cgagccaaat 


2700 


caggaaggcc 


ttgacttatt 


tgatgcttca 


tcggtacaag 


atcgttccag 


actgttcgtt 


2760 


tcactcgcga 


atcactacgt 


tgatctgggt 


gcgctgcgct 


atactataaa 


gacggaaaat 


2820 


ggcataacac 


gcctctataa 


tccctatgcc 


ggtaacggcc 


gtccggtgaa 


acctgctccc 


2880 


tgcgtcgacg 


gcatcattac 


ctccaaaact 


aaatctctga 


tagaaggtag 


atttggcggt 


2940 


ttcacgggcg 


cacgcaaatc 


agcgcgtaaa 


cgtaagaacc 


aggcgctagc 


gggcggtggc 


3000 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagcgcac 


tagtgctgca 


gtgtatcaag 


3060 


gttaacaact 


gggatttatt 


cttcagcccg 


agtgaagaca 


acttcaccaa 


cgacctgaac 


3120 


aaaggtgaag 


aaatca'cctc 


agatactaac 


atcgaagcag 


ccgaagaaaa 


catctcgctg 


3180 


gacctgatcc 


agcagtacta 


cctgaccttt 


aatttcgaca 


acgagccgga 


aaacatttct 


3240 


atcgaaaacc 


tgagctctga 


tatcatcggc 


cagctggaac 


tgatgccgaa 


catcgaacgt 


3300 


ttcccaaacg 


gtaaaaagta 


cgagctggac 


aaatatacca 


tgttccacta 


cctgcgcgcg 


3360 


caggaatttg 


aacacggcaa 


atcccgtatc 


gcactgacta 


actccgttaa 


cgaagctctg 


3420 


ctcaacccgt 


cccgtgtata 


caccttcttc 


tctagcgact 


acgtgaaaaa 


ggtcaacaaa 


3480 


gcgactgaag 


ctgcaatgtt 


cttgggttgg 


gttgaacagc 


ttgtttatga 


ttttaccgac 


3540 


gagacgtccg 


aagtatctac 


taccgacaaa 


attgcggata 


tcactatcat 


catcccgtac 


3600 


atcggtccgg 


ctctgaacat 


tggcaacatg 


ctgtacaaag 


acgacttcgt 


tggcgcactg 


3660 


atcttctccg 


gtgcggtgat 


cctgctggag 


ttcatcccgg 


aaatcgccat 


cccggtactg 


3720 


ggcacctttg 


ctctggtttc 


ttacattgca 


aacaaggttc 


tgactgtaca 


aaccatcgac 


3780 


aacgcgctga 


gcaaacgtaa. 


cgaaaaatgg 


gatgaagttt 


acaaatatat 


cgtgaccaac 


3840 
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tggctggcta 


aggttaatac 


tcagatcgac 


ctcatccgca aaaaaatgaa agaagcactg 


3900 


gaaaaccagg 


cggaagctac 


caaggcaatc 


attaactacc agtacaacca gtacaccgag 


3960 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt cctctaaact gaacgaatcc 


4020 


atcaacaaag 


ctatgatcaa 


catcaacaag 


ttcctgaacc agtgctctgt aagctatctg 


4080 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg acttcgatgc gtctctgaaa 


4140 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc tgatcggtca ggttgatcgt 


4200 


ctgaaggaca 


aagtgaacaa 


taccttatcg 


accgacatcc cttttcagct cagtaaatat 


4260 


gtcgataacc 


aacgcctttt 


gtccactcta 


gactag 


4296 



<210> 88 
<211> 1431 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 88 

Gly Ser Leu Val Arg Asp Asp Val Asp Tyr Gin He Phe Arg Asp Phe 
15 10 15 



Ala Glu Asn Lys Gly Lys Phe Phe Val Gly Ala Thr Asp Leu Ser Val 
20 25 30 



Lys Asn Lys Arg Gly Gin Asn He Gly Asn Ala Leu Ser Asn Val Pro 
35 40 45 



Met He Asp Phe Ser Val Ala Asp Val Asn Lys Arg He Ala Thr Val 
50 55 60 



Val Asp Pro Gin Tyr Ala Val Ser Val Lys His Ala Lys Ala Glu Val 
65 70 75 80 



His Thr Phe Tyr Tyr Gly Gin Tyr Asn Gly His Asn Asp Val Ala Asp 
85 90 95 



Lys Glu Asn Glu Tyr Arg Val Val Glu Gin Asn Asn Tyr Glu Pro His 
100 105 110 



Lys Ala Trp Gly Ala Ser Asn Leu Gly Arg Leu Glu Asp Tyr Asn Met 
115 120 -125 



Ala jArg. Phe. AsE ,Lys Bhe iVai: Thrc^lii' ValuAlaitPro^Ile Ala. Pre Thr 
130 135 140 



wo 2006/059105 



163 



PCT/GB2005/004598 



Asp Ala Gly Gly Gly Leu Asp Thr Tyr Lys Asp Lys Asn Arg Phe Ser 
145 150 155 160 



Ser Phe Val Arg lie Gly Ala Gly Arg Gin Leu Val Tyr Glu Lys Gly 
165 170 175 



Val Tyr His Gin Glu Gly Asn Glu Lys Gly Tyr Asp Leu Arg Asp Leu 
180 185 190 



Ser Gin Ala Tyr Arg Tyr Ala lie Ala Gly Thr Pro Tyr Lys Asp lie 
195 200 205 



Asn He Asp Gin Thr Met Asn Thr Glu Gly Leu He Gly Phe Gly Asn 
210 215 220 



His Asn Lys Gin Tyr Ser Ala Glu Glu Leu Lys Gin Ala Leu Ser Gin 
225 230 235 240 



Asp Ala Leu Thr Asn Tyr Gly Val Leu Gly Asp Ser Gly Ser Pro Leu 
245 250 255 



Phe Ala Phe Asp Lys Gin Lys Asn Gin Trp Val Phe Leu Gly Thr Tyr 
260 265 270 



Asp Tyr Trp Ala Gly Tyr Gly Lys Lys Ser Trp Gin Glu Trp Asn He 
275 280 285 



Tyr Lys Lys Glu Phe Ala Asp Lys He Lys Gin His Asp Asn Ala Gly 
290 295 300 



Thr Val Lys Gly Asn Gly Glu His His Trp Lys Thr Thr Gly Thr Asn 
305 310 315 320 



Ser His He Gly Ser Thr Ala Val Arg Leu Ala Asn Asn Glu Gly Asp 
325 330 335 



Ala Asn Asn Gly Gin Asn Val Thr Phe Glu Asp Asn Gly Thr Leu Val 
340 345 350 



Leu Asn Gin Asn He Asn Gin Gly Ala Gly Gly Leu Phe Phe Lys Gly 
355 360 365 



Asp Tyr Thr Val Lys Gly Ala Asn Asn Asp He Thr Trp Leu Gly Ala 
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Gly lie Asp Val Ala Asp Gly Lys Lys Val Val Trp Gin Val Lys Asn 
385 390 395 400 



Pro Asn Gly Asp Arg Leu Ala Lys lie Gly Lys Gly Thr Leu Glu lie 
405 410 415 



Asn Gly Thr Gly Val Asn Gin Gly Gin Leu Lys Val Gly Asp Gly Thr 
420 425 430 



Val lie Leu Asn Gin Lys Ala Asp Ala Asp Lys Lys Val Gin Ala Phe 
435 440 445 



Ser Gin Val Gly lie Val Ser Gly Arg Gly Thr Leu Val Leu Asn Ser 
450 455 460 



Ser Asn Gin lie Asn Pro Asp Asn Leu Tyr Phe Gly Phe Arg Gly Gly 
465 470 475 480 



Arg Leu Asp Ala Asn Gly Asn Asp Leu Thr Phe Glu His lie Arg Asn 
485 490 495 



Val Asp Glu Gly Ala Arg lie Val Asn His Asn Thr Asp His Ala Ser 
500 505 510 



Thr lie Thr Leu Thr Gly Lys Ser Leu lie Thr Asn Pro Asn Ser Leu 
515 520 525 



Ser Val His Ser He Gin Asn Asp Tyr Asp Glu Asp Asp Tyr Ser Tyr 
530 535 540 



Tyr Tyr Arg Pro Arg Arg Pro He Pro Gin Gly Lys Asp Leu Tyr Tyr 
545 550 555 560 



Lys Asn Tyr Arg Tyr Tyr Ala Leu Lys Ser Gly Gly Arg Leu Asn Ala 
565 570 575 



Pro Met Pro Glu Asn Gly Val Ala Glu Asn Asn Asp Trp He Phe Met 
580 585 590 



Gly Tyr Thr Gin Glu Glu Ala Arg Lys Asn Ala Met Asn His Lys Asn 
595 600 605 



Asn Arg Arg He Gly Asp Phe Gly Gly Phe Phe Asp Glu Glu Asn Gly 
610 615 620 
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Lys Gly His Asn Gly Ala Leu Asn Leu Asn Phe Asn Gly Lys Ser Ala 
625 630 635 640 



Gin Lys Arg Phe Leu Leu Thr Gly Gly Ala Asn Leu Asn Gly Lys lie 
645 650 655 



Ser Val Thr Gin Gly Asn Val Leu Leu Ser Gly Arg Pro Thr Pro His 
660 665 670 



Ala Arg Asp Phe Val Asn Lys Ser Ser Ala Arg Lys Asp Ala His Phe 
675 680 685 



Ser Lys Asn Asn Glu Val Val Phe Glu Asp Asp Trp lie Asn Arg Thr 
690 695 700 



Phe Lys Ala Ala Glu lie Ala Val Asn Gin Ser Ala Ser Phe Ser Ser 
705 710 715 720 



Gly Arg Asn Val Ser Asp lie Thr Ala Asn lie Thr Ala Thr Asp Asn 
725 730 735 



Ala Lys Val Asn Leu Gly Tyr Lys Asn Gly Asp Glu Val Cys Val Arg 
740 745 750 



Ser Asp Tyr Thr Gly Tyr Val Thr Cys Asn Thr Gly Asn Leu Ser Asp 
755 760 765 



Lys Ala Leu Asn Ser Phe Asp Ala Thr Arg lie Asn Gly Asn Val Asn 
770 775 780 



Leu Asn Gin Asn Ala Ala Leu Val Leu Gly Lys Ala Ala Leu Trp Gly 
785 790 795 800 



Lys lie Gin Gly Gin Gly Asn Ser Arg Val Ser Leu Asn Gin His Ser 
805 810 815 



Lys. Trp His Leu Thr Gly Asp Ser Gin Val His Asn Leu Ser Leu Ala 
820 825 830 



Asp Ser His He His Leu Asn Asn Ala Ser Asp Ala Gin Ser Ala Asn 
835 840 845 



Lys Tyr His Thr He Lys He Asn His Leu Ser Gly Asn Gly His Phe 
850 855 860 



His Tyr Leu Thr Asp Leu Ala Lys Asn Leu Gly Asp Lys Val Leu Val 
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865 870 875 880 



Lys Glu Ser Ala Ser Gly His Tyr Gin Leu His Val Gin Asn Lys Thr 
885 890 895 



Gly Glu Pro Asn Gin Glu Gly Leu Asp Leu Phe Asp Ala Ser Ser Val 
900 90S 910 



Gin Asp Arg Ser Arg Leu Phe Val Ser Leu Ala Asn His Tyr Val Asp 
915 920 925 



Leu Gly Ala Leu Arg Tyr Thr lie Lys Thr Glu Asn Gly He Thr Arg 
930 935 940 



Leu Tyr Asn Pro Tyr Ala Gly Asn Gly Arg Pro Val Lys Pro Ala Pro 
945 950 955 960 



Cys Val Asp Gly lie He Thr Ser Lys Thr Lys Ser Leu He Glu Gly 
965 970 975 



Arg Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys 
980 985 990 



Asn Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
995 1000 1005 



Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn 
1010 1015 1020 



Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp 
1025 1030 1035 



Leu Asn Lys Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala 
1040 1045 1050 



Ala Glu Glu Asn He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu 
1055 1060 1065 



Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser He Glu Asn 
1070 1075 1080 



Leu Ser Ser- Asp He He Gly Gin Leu Glu Leu Met Pro Asn He 
1085 1090 1095 



. Glu Arg Phe Pro Asn. Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr* 
1100 1105 1110 
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Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
1115 1120 1125 



Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
1130 1135 1140 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val 
1145 1150 1155 



Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin 
1160 1165 1170 



Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
1175 1180 1185 



Asp Lys lie Ala Asp lie Thr He He He Pro Tyr He Gly Pro 
1190 1195 1200 



Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly 
1205 1210 1215 



Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
1220 1225 1230 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr 
1235 1240 1245 



He Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu 
1250 1255 1260 



Ser Lys Arg Asn Glu Lys Trp Asp Glu. Val Tyr Lys Tyr He Val 
1265 1270 1275 



Thr Asn Trp Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg 
1280 1285 1290 



Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys 
1295 1300 1305 



Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys 
1310 1315 1320 



Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys Leu Asn 
. > -ia25 . v. .1330 -u .r:.: • .J iU^S 
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Glu Ser lie Asn Lys Ala Met lie Asn lie Asn Lys Phe Leu Asn 
1340 1345 X350 



Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met lie Pro Tyr Gly 
1355 1360 1365 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
1370 1375 1380 



Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu lie Gly Gin Val 
1385 1390 1395 



Asp Arg Leu Lys Asp Lys Val Asn Asn Thx Leu Ser Thr Asp lie 
1400 1405 1410 



Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
1415 1420 1425 



Thr Leu Asp 
1430 



<210> 89 
<211> 1357 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Synthetic 












<400> 89 
gctagcgggc 


ggtggcggta 


gcggcggtgg 


cggtagcggc 


ggtggcggta 


gcgcactagt 


60 


gctgcagtgt 


atcaaggtta 


acaactggga 


tttattcttc 


agcccgagtg 


aagacaactt 


120 


caccaacgac 


ctgaacaaag 


gtgaagaaat 


cacctcagat 


actaacatcg 


aagcagccga 


180 


agaaaacatc 


tcgctggacc 


tgatccagca 


gtactacctg 


acctttaatt 


tcgacaacga 


240 ' 


gccggaaaac 


atttctatcg 


aaaacctgag 


ctctgatatc 


atcggccagc 


tggaactgat 


300 


gccgaacatc 


gaacgtttcc 


caaacggtaa 


aaagtacgag 


ctggacaaat 


ataccatgtt 


360 


ccactacctg 


cgcgcgcagg 


aatttgaaca 


cggcaaatcc 


cgtatcgcac 


tgactaactc 


420 


cgttaacgaa 


gctctgctca 


acccgtcccg 


tgtatacacc 


ttcttctcta 


gcgactacgt 


480 


gaaaaaggtc 


aacaaagcga 


ctgaagctgc 


aatgttcttg 


ggttgggttg 


aacagcttgt 


540 


ttatgatttt 


accgacgaga 


cgtccgaagt 


atctactacc 


gacaaaattg 


cggatatcac 


600 


tatcatcatc 


ccgtacatcg 


gtccggctct 


gaacattggc 

''.it 


aacatgctgt 


acaaagacga 


660 


cttcgttggc 


gcactgatct 


tctccggtgc 


ggtgatcctg 


ctggagttca 


tcccggaaat 


720 
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cgccatcccg 


gtactgggca 


cctttgctct 


ggtttcttac attgcaaaca aggttctgac 


780 


tgtacaaacc 


atcgacaacg 


cgctgagcaa 


acgtaacgaa aaatgggatg aagtttacaa 


840 


atatatcgtg 


accaactggc 


tggctaaggt 


taatactcag atcgacctca tccgcaaaaa 


900 


aatgaaagaa 


gcactggaaa 


accaggcgga 


agctaccaag gcaatcatta actaccagta 


960 


caaccagtac 


accgaggaag 


aaaaaaacaa 


catcaacttc aacatcgacg atctgtcctc 


1020 


taaactgaac 


gaatccatca 


acaaagctat 


gatcaacatc aacaagttcc tgaaccagtg 


1080 


ctctgtaagc 


tatctgatga 


actccatgat 


cccgtacggt gttaaacgtc tggaggactt 


1140 


cgatgcgtct 


ctgaaagacg 


ccctgctgaa 


atacatttac gacaaccgtg gcactctgat 


1200 


cggtcaggtt 


gatcgtctga 


aggacaaagt 


gaacaatacc ttatcgaccg acatcccttt 


1260 


tcagctcagt 


aaatatgtcg 


ataaccaacg 


ccttttgtcc actctagaaa tagaaggtag 


1320 


aagtgggcac 


catcaccatc 


accattaatg 


aaagctt 


1357 



<210> 90 

<211> 2745 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 90 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg ttcgcggtat cccgttctgg 


360 


99cggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta aattcgcaac tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac- 


'ttttggcggt 


cacgacgcta aattcatcga ctctctgcaa 


840 
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gaaaacgagt tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac agcttgttta 


tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg atatcactat 


catcatcccg 


tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca aagacgactt 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gag t teat cc cggaaatcgc 


catcccggta 


ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


gcaaacaagg ttctgactgt 


acaaaccatc 


gacaacgcgc 


tgagcaaacg 


taacgaaaaa 


2220 


tgggatgaag tttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc g'caaaaaaat 


gaaagaagca 


ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 


atcattaact accagtacaa 


ccagtacacc 


gaggaagaaa 


aaaacaacat 


caacttcaac 


2400 


atcgacgatc tgtcctctaa 


actgaacgaa 


tccatcaaca 


aagctatgat 


caacatcaac 


2460 


aagttcctga accagtgctc 


tgtaagctat 


ctgatgaact 


ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg aggacttcga 


tgcgtctctg 


aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaccgtggca ctctgatcgg 


tcaggttgat 


cgtctgaagg 


acaaagtgaa 


caatacctta 


2640 


tcgaccgaca tcccttttca 


gctcagtaaa 


tatgtcgata accaacgcct tttgtccact 


2700 
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ctagaaatag aaggtagaag tgggcaccat caccatcacc attaa 2745 



<210> 91 
<211> 914 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 91 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
»:u ,.180 ... o - .. 185 190 , r 
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Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 
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Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 • 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
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675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asa Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Glu He Glu Gly Arg Ser Gly His His His His 
900 905 910 



■His His 
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<210> 92 

<211> 619 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 92 



gctagcgggc 


ggtggcggta 


gcggcggtgg cggtagcggc ggtggcggta gcgcactagt 


60 


gctgcagtgt 


atcaatctgg 


attgggacgt aatccgtgat aagaccaaaa caaaaatcga 


120 


gtctttgaaa 


gaacacggcc 


cgatcaaaaa taagatgtct gaatcaccca ataaaactgt 


180 


ttcggaggaa 


aaagcgaaac 


agtatttgga agagtttcat caaaccgcgc ttgaacatcc 


240 


ggagctcagt 


gaactgaaaa 


cagtgacggg aacgaatcct gtttttgcag gcgcaaacta 


300 


tgcggcttgg 


gccgtgaatg 


ttgcccaagt aattgatagt gagaccgcag acaacctgga 


360 


aaagacgacc 


gcagcgttaa 


gcattttacc ggggattggt tccgtgatgg gtatagcgga 


420 


tggagcggtc 


caccataaca 


ctgaggaaat tgtcgcccag tcaatcgctc tgagttccct 


480 


gatggttgca 


caggctatcc 


cactcgtggg ggaactggtt gacataggtt tcgccgccta 


540 


caacttcgta 


gaaagcatta 


ttaatctttt tcaggtggtg cataacagct acaaccgccc 


600 


tctagaatga 


taaaagctt 




619 



<210> 93 
<211>. 1971 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 93 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 
cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 
ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 
tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 
tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 
ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 
cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 
gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 
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ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


'tgatagaagg 


tagatacggt 


ggtttcctgg 


cgctagcggg 


cggtggcggt 


1380 


agcggcggtg 


gcggtagcgg 


cggtggcggt 


agcgcactag 


tgctgcagtg 


tatcaatctg 


1440 


gattgggacg 


taatccgtga 


taagaccaaa 


acaaaaatcg 


agtctttgaa 


agaacacggc 


1500 


ccgatcaaaa 


ataagatgtc 


tgaatcaccc 


aataaaactg 


tttcggagga 


aaaagcgaaa 


1560 


cagtatttgg 


aagagtttca 


tcaaaccgcg 


cttgaacatc 


cggagctcag 


tgaactgaaa 


1620 


acagtgacgg 


gaacgaatcc 


tgtttttgca 


ggcgcaaact 


atgcggcttg 


ggccgtgaat 


1680 


gttgcccaag 


taattgatag 


tgagaccgca 


gacaacctgg 


aaaagacgac 


cgcagcgtta 


1740 


agcattttac 


cggggattgg 


ttccgtgatg 


ggtatagcgg 


atggagcggt 


ccaccataac 


1800 


actgaggaaa 


ttgtcgccca -gtcaatcgct 


ctgagttccc 


tgatggttgc 


acaggctatc 


1860 


ccactcgtgg 


gggaactggt tgacataggt 


ttcgccgcct 


acaacttcgt 


agaaagcatt 


1920 


attaatcttt 


ttcaggtggt gcataacagc 


tacaaccgcc 


ctctagaatg 


a 


1971 



<210> 94 

<211> 656 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 94 



GclyoSer ;M^fr :l31.u FJie Val.Asn Lys. GjLn .Rhe^Asn .,Tyr Lys T^usp Jffjro Val 
15 10 15 
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Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 ' 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 . 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
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Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Leu Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly 
450 455 460 



Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Asn Leu 
465 470 475 480 



Asp Trp Asp Val He Arg Asp Lys Thr Lys Thr Lys He Glu Ser Leu 
485 490 495 
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Lys Glu His Gly Pro He Lys Asn Lys Met Ser Glu Ser Pro Asn Lys 
500 505 510 



Thr Val Ser Glu Glu Lys Ala Lys Gin Tyr Leu Glu Glu Phe His Gin 
515 520 525 



Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly 
530 535 540 



Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn 
545 550 555 560 



Val Ala Gin Val He Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr 
565 570 575 



Thr Ala Ala Leu Ser He Leu Pro Gly He Gly Ser Val Met Gly He 
580 585 590 



Ala Asp Gly Ala Val His His Asn Thr Glu Glu He Val Ala Gin Ser 
595 600 605 



He Ala Leu Ser Ser Leu Met Val Ala Gin Ala He Pro Leu Val Gly 
610 615 620 



Glu' Leu Val Asp He Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser He 
625 630 635 640 



He Asn Leu Phe Gin Val Val His Asn Ser Tyr Asn Arg Pro Leu Glu 
645 650 655 



<210> 95 
<211> 1329 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic 
<400> 95 

ggatccatgc ctattactat taacaatttt cgttatagcg atcccgtcaa caatgacacc 60 
attatcatga tggaaccgcc atattgcaaa ggactggaca tttactataa agccttcaag 120 
attactgacc gcatttggat tgttccagag cgttacgagt tcgggacgaa accagaagat 180 
tttaacccgc cttcatcgct gatcgaagga gcatcagagt attacgatcc gaactatctg 240 
cgtacggaca gcgataaaga ccgcttctta cagaccatgg tcaaactttt taaccgtatt 300 
aagaaeaatg* tggccggaga agcactcttg gataagatta tcaacgcgat tccatacctg 360 
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ggcaattctt 


acagcctgct 


ggataaattt gacacaaata gtaattcagt cagctttaac 


420 


ctgttagaac 


aagatccgag 


tggcgcaacc acgaagtctg ccatgctgac aaatctgatc 


480 


atttttggtc 


caggtcctgt 


actgaataaa aatgaagtac gcggcatcgt tctccgcgtg 


540 


gacaataaga 


actacttccc 


atgccgtgac ggcttcggtt cgatcatgca gatggctttc 


600 


tgtccggagt 


acgttccgac 


gtttgataat gttattgaga atatcacgag tttaacaatc 


660 


ggtaagtcaa 


aatattttca 


agatccggcc cttctcctta tgcatgaact gattcacgtg 


720 


ctgcacggct 


tatatggtat 


gcaagtgtcc tcgcatgaaa tcattccgtc caaacaggaa 


780 


atttatatgc 


agcataccta 


cccgatttca gctgaagagt tgtttacgtt tggtggccag 


840 


gacgcgaatt 


tgatctccat 


cgacatcaaa aacgatctgt atgagaaaac attaaatgac 


900 


tataaagcga 


ttgcgaacaa 


actgtctcag gtgactagct gcaacgatcc taacattgat 


960 


attgattcct 


acaaacaaat 


ttatcaacag aaataccagt tcgataaaga cagcaatggt 


1020 


cagtatatcg 


taaacgaaga 


taaatttcag atcctgtata acagcattat gtatggcttt 


1080 


accgaaattg 


agttggggaa 


gaaatttaac attaaaaccc gtctgtctta ttttagtatg 


1140 


aaccatgatc 


cggtgaaaat 


ccccaatctg cttgatgata ccatttataa tgataccgaa 


1200 


gggttcaaca 


ttgaatctaa 


ggatctgaaa tccgaataca aaggccaaaa tatgcgtgtt 


1260 


aatactaacg 


ctttccgtaa 


tgttgatggt agtggactcg tctcgaaact gattgggttg 


1320 


tgtgtcgac 






1329 



<210> 96 
<211> 2736 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 96 

ggatccatgc ctattactat taacaatttt cgttatagcg atcccgtcaa caatgacacc 60 
attatcatga tggaaccgcc atattgcaaa ggactggaca tttactataa agccttcaag 120 
attactgacc gcatttggat tgttccagag cgttacgagt tcgggacgaa accagaagat 180 
t-ttaacccgc cttcatcgct gatcgaagga gcatcagagt attacgatcc gaactatctg 240 
cgtacggaca gcgataaaga ccgcttctta cagaccatgg tcaaactttt taaccgtatt 300 
aagaacaatg tggccggaga agcactcttg gataagatta tcaacgcgat tccatacctg 360 
ggcaattctt acagcctgct ggataaattt gacacaaata gtaattcagt cagctttaac 420 
ctgttagaac aagatccgag tggcgcaacc acgaagtctg ccatgctgac aaatctgatc 480 
atttttggtc caggtcctgt actgaataaa aatgaagtac gcggcatcgt tctccgcgtg 540 
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actacttccc 


atgccgtgac 


ggcttcggtt 


cgafccatgca 


gacggccccc 


ouu 


t9tccg9agt 


acgttccgac 


gtttgacaau 


gccacugaga 


atatcacgag 


ttitiaacaabc 


bbU 


99taa9tcaa 


aatatcctca 


agatccggcc 




tgcatgaact 


gattcacgt^g 


•7 o n 


ctgcacggct 


tatatggtat 


gcaagtgtcc 


tcgcatgaaa 


tcattccgtc 


caaacaggaa 


•n Q r\ 
780 


atttatatgc 


agcataccta 


cccgatttca 


gctgaagagt 


tgtttacgtt 


tggtggccag 


840 


gacgcgaatt 


tgatctccat 


cgacatcaaa 


aacgatctgt 


atgagaaaac 


attaaatgac 


900 


tataaagcga 


ttgcgaacaa 


actgtctcag 


gtgactagct 


gcaacgatcc 


taacattgat 


960 


attgattcct 


acaaacaaat 


ttatcaacag 


aaataccagt 


tcgataaaga 


cagcaatggt 


1020 


cagtatatcg 


taaacgaaga 


taaatttcag 


atcctgtata 


acagcattat 


gtatggcttt 


1080 


accgaaattg 


agttggggaa 


gaaatttaac 


attaaaaccc 


gtctgtctta 


ttttagtatg 


1140 


aaccatgatc 


cggtgaaaat 


ccccaatctg 


cttgatgata 


ccatttataa 


tgataccgaa 


1200 


gggttcaaca 


ttgaatctaa 


ggatctgaaa 


tccgaataca 


aaggccaaaa 


tatgcgtgtt 


1260 


aatactaacg 


ctttccgtaa 


tgttgatggt 


agtggactcg 


tctcgaaact 


gattgggttg 


1320 


tgtgtcgacg 


gcatcattac 


ctccaaaact 


aaatctctga 


tagaaggtag 


atttggcggt 


1380 


ttcacgggcg 


cacgcaaatc 


agcgcgtaaa 


cgtaagaacc 


aggcgctagc 


gggcggtggc 


1440 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagcgcac 


tagtgctgca 


gtgtatcaag 


1500 


gttaacaact 


gggatttatt 


cttcagcccg 


agtgaagaca 


acttcaccaa 


cgacctgaac 


1560 


aaaggtgaag 


aaatcacctc 


agatactaac 


atcgaagcag 


ccgaagaaaa 


catctcgctg 


1620 


gacctgatcc 


agcagtacta 


cctgaccttt 


aatttcgaca 


acgagccgga 


aaacatttct 


1680 


atcgaaaacc 


tgagctctga 


tatcatcggc 


cagctggaac 


tgatgccgaa 


catcgaacgt 


1740 


ttcccaaacg 


gtaaaaagta 


cgagctggac 


aaatatacca 


Xmm A— 1— . ^ A— — .. 

tgttccacta 


cctgcgcgcg 


1800 


caggaatttg 


aacacggcaa 


atcccgtatc 


gcactgacta 


actccgttaa 


cgaagctctg 


I860 


ctcaacccgt 


cccgtgtata 


caccttcttc 


tctagcgact 


acgtgaaaaa 


ggtcaacaaa 


1920 


gcgactgaag 


ctgcaatgtt 


cttgggttgg 


gttgaacagc 


ttgtttatga 


ttttaccgac 


1980 


gagacgtccg 


aagtatctac 


taccgacaaa 


attgcggata 


tcactatcat 


catcccgtac 


2040 


atcggtccgg 


ctctgaacat 


tggcaacatg 


ctgtacaaag 


acgacttcgt 


tggcgcactg 


2100 


atcttctccg 


gtgcggtgat 


cctgctggag 


ttcatcccgg 


aaatcgccat 


cccggtactg 


2160 


ggcacctttg 


ctctggtttc 


ttacattgca 


aacaaggttc 


tgactgtaca 


aaccatcgac 


2220 


aacgcgctga 


gcaaacgtaa 


cgaaaaatgg 


gatgaagttt 


acaaatatat 


cgtgaccaac 


2280 




aggttaata.c 


tcaga.t.cgac^ 


ctcatccgca 


aaaaaatgaa 


agaagcactn 


2340 
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gaaaaccagg 


cggaagctac 


caaggcaatc 


attaactacc agtacaacca 


gtacaccgag 


2400 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt cctctaaact 


gaacgaatcc 


2460 


atcaacaaag 


ctatgatcaa 


catcaacaag 


ttcctgaacc agtgctctgt 


aagctatctg 


2520 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg acttcgatgc 


gtctctgaaa 


2580 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc tgatcggtca 


ggttgatcgt 


2640 


ctgaaggaca 


aagtgaacaa 


taccttatcg 


accgacatcc cttttcagct 


cagtaaatat 


2700 


gtcgataacc 


aacgcctttt 


gtccactcta 


gactag 




2736 



<210> 


97 


<211> 


911 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


97 



Gly Ser Met Pro He Thr He Asn Asn Phe Arg Tyr Ser Asp Pro Val 
15 10 15 



Asn Asn Asp Thr He He Met Met Glu Pro Pro Tyr Cys Lys Gly Leu 
20 25 30 



Asp He Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He Val 
35 40 45 



Pro Glu Arg Tyr Glu Phe Gly Thr Lys Pro Glu Asp Phe Asn Pro Pro 
50 55 60 



Ser Ser Leu He Glu Gly Ala Ser Glu Tyr Tyr Asp Pro Asn Tyr Leu 
65 70 75 80 



Arg Thr Asp Ser Asp Lys Asp Arg Phe Leu Gin Thr Met Val Lys Leu 
85 90 95 



Phe Asn Arg He Lys Asn Asn Val Ala Gly Glu Ala Leu Leu Asp Lys 
100 105 110 



He He Asn Ala He Pro Tyr Leu Gly Asn Ser Tyr Ser Leu Leu Asp 
115 120 125 



Lys Phe Asp Thr Asn Ser Asn Ser Val Ser Phe Asn Leu Leu Glu Gin 
130 135 140 
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Asp Pro Ser Gly Ala Thr Thr Lys Ser Ala Met Leu Thr Asn Leu lie 
145 150 155 160 



lie Phe Gly Pro Gly Pro Val Leu Asn Lys Asn Glu Val Arg Gly He 
165 170 175 



Val Leu Arg Val Asp Asn Lys Asn Tyr Phe Pro Cys Arg Asp Gly Phe 
180 185 190 



Gly Ser He Met Gin Met Ala Phe Cys Pro Glu Tyr Val Pro Thr Phe 
195 200 205 



Asp Asn Val He Glu Asn He Thr S.er Leu Thr He Gly Lys Ser Lys 
210 215 220 



Tyr Phe Gin Asp Pro Ala Leu Leu Leu Met His Glu Leu He His Val 
225 230 235 240 



Leu His Gly Leu Tyr Gly Met Gin Val Ser Ser His Glu He He Pro 
245 250 255 



Ser Lys Gin Glu He Tyr Met Gin His Thr Tyr Pro He Ser Ala Glu 
260 265 270 



Glu Leu Phe Thr Phe Gly Gly Gin Asp Ala Asn Leu He Ser He Asp 
275 280 285 



He Lys Asn Asp Leu Tyr Glu Lys Thr Leu Asn Asp Tyr Lys Ala He 
290 295 300 



Ala Asn Lys Leu Ser Gin Val Thr Ser Cys Asn Asp Pro Asn He Asp 
305 310 315 320 



He Asp Ser Tyr Lys Gin He Tyr Gin Gin Lys Tyr Gin Phe Asp Lys 
325 330 335 



Asp Ser Asn Gly Gin Tyr He Val Asn Glu Asp Lys Phe Gin He Leu 
340 345 350 



Tyr Asn Ser He Met Tyr Gly Phe Thr Glu He Glu Leu Gly Lys Lys 
355 360 365 - 



Phe Asn He Lys Thr Arg Leu Ser Tyr Phe Ser Met Asn His Asp Pro 
370 375 380 

Val Lys He Pro Asn Leu Leu Asp Asp Thr He Tyr Asn Asp Thr Glu 
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385 390 395 400 



Gly Phe Asn lie Glu Ser Lys Asp Leu Lys Ser Glu Tyr Lys Gly Gin 
405 410 415 



Asn Met Arg Val Asn Thr Asn Ala Phe Arg Asn Val Asp Gly Ser Gly 
420 425 430 



Leu Val Ser Lys Leu lie Gly Leu Cys Val Asp Gly lie He Thr Ser 
435 440 445 



Lys Thr Lys Ser Leu He Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala 
450 455 460 



Arg Lys Ser Ala Arg Lys Arg Lys Asn Gin Ala Leu Ala Gly Gly Gly 
465 470 475 480 



Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu 
485 490 495 



Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp 
515 520 525 



Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin 
530 535 540 



Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser 
545 550 555 560 



He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 605 



Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyor- Thr Phe Pho Ser Ser Asp T^^' Val Lys Lys Val Asn Lys 
625 630 635 640 
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Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys lie Ala 
660 665 670 



Asp lie Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He Gly 
675 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly 
690 695 700 



Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val 
725 730 735 



Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
7.70 775 780 



Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 825 830 



Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg 

86.6^ ...aii . c-..^ ■ >! ., ^.70vr '.',1 ./ *P75:'. 880 
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Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp lie Pro Phe Gin 
885 890 895 



Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 



<210> 98 

<211> 180 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 98 

ggatccacgc acgtcgacgc gattgatggt 


cgttttggcg 


gtttcacggg 


cgcacgcaaa 


60 


tcagcgcgta aacgtaagaa 


ccaggcgcta 


gcgggcggtg 


gcggtagcgg 


cggtggcggt 


120 


agcggcggtg gcggtagcgc 


actagtgctg 


cagacgcacg 


gtctagaatg 


ataaaagctt 


180 


<210> 99 
<211> 2715 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Synthetic 












<400> 99 

ggatccgaat tcatgccgat 


caccatcaac 


aacttcaact 


acagcgatcc 


ggtggataac 


60 


aaaaacatcc tgtacctgga 


tacccatctg 


aataccctgg 


cgaacgaacc 


ggaaaaagcg 


120 


tttcgtatca ccggcaacat 


ttgggttatt 


ccggatcgtt 


ttagccgtaa 


cagcaacccg 


180 


aatctgaata aaccgccgcg 


tgttaccagc 


ccgaaaagcg 


gttattacga 


tccgaactat 


240 


ctgagcaccg atagcgataa 


agataccttc 


ctgaaagaaa 


tcatcaaact 


gttcaaacgc 


300 


atcaacagcc gtgaaattgg 


cgaagaactg 


atctatcgcc 


tgagcaccga 


tattccgttt 


360 


ccgggcaaca acaacacccc 


gatcaacacc 


tttgatttcg 


atgtggattt 


caacagcgtt 


420 


gatgttaaaa cccgccaggg 


taacaattgg 


gtgaaaaccg 


gcagcattaa 


cccgagcgtg 


480 


attattaccg gtccgcgcga 


aaacattatt 


gatccggaaa 


ccagcacctt 


taaactgacc 


540 


aacaacacct ttgcggcgca 


ggaaggtttt 


ggcgcgctga 


gcattattag 


cattagcccg 


600 


cgctttatgc tgacctatag 


caacgcgacc 


aacgatgttg 


gtgaaggccg 


tttcagcaaa 


660 


agcgaatttt gcatggaccc 


gatcctgatc 


ctgatgcatg 


aactgaacca 


tgcgatgcat 


720 


aacctgtatg gcatcgcgat 


tccgaacgat 


cagaccatta 


gcagcgtgac 


cagcaacatc 


780 


ttttacagcc agtacaacgt 


gaaactggaa 


tatgcggaaa 


tctatgcgtt 


tggcggtccg 


840 
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accattgatc 


tgattccgaa 


aagcgcgcgc 


aaatacttcg 


aagaaaaagc 


gctggattac 


o n n 


tatcgcagca 


ttgcgaaacg 


tctgaacagc 


attaccaccg 


cgaatccgag 


cagcttcaac 


c\ ^ r\ 
960 


aaatatatcg 


gcgaatataa 


acagaaactg 


atccgcaaat 


atcgctttgt 


ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa 


ttcgtggaac 


tgtacaacga 


actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc 


tataacgtgc 


agaaccgtaa 


aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat 


attctggatg 


ataacgtgta 


cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg 


aacgttctgt 


ttatgggcca 


gaacctgagc 


1260 


cgtaatccgg 


cgctgcgtaa 


agtgaacccg 


gaaaacatgc 


tgtacctgtt 


caccaaattt 


1320 


tgcgtcgacg 


cgattgatgg 


tcgttttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtcgt 


gaactgctgg 


tgaaaaacac 


cgatctgccg 


1500 


tttattggcg 


atatcagcga 


tgtgaaaacc 


gatatcttcc 


tgcgcaaaga 


tatcaacgaa 


1560 


gaaaccgaag 


tgatctacta 


cccggataac 


gtgagcgttg 


atcaggtgat 


cctgagcaaa 


1620 


aacaccagcg 


aacatggtca 


gctggatctg 


ctgtatccga 


gcattgatag 


cgaaagcgaa 


1680 


attctgccgg 


gcgaaaacca 


ggtgttttac 


gataaccgta 


cccagaacgt 


ggattacctg 


1740 


aacagctatt 


actacctgga 


aagccagaaa 


ctgagcgata 


acgtggaaga 


ttttaccttt 


1800 


acccgcagca 


ttgaagaagc 


gctggataac 


agcgcgaaag 


tttacaccta 


ttttccgacc 


1860 


ctggcgaaca 


aagttaatgc 


gggtgttcag 


ggcggtctgt 


ttctgatgtg 


ggcgaacgat 


1920 


gtggtggaag 


atttcaccac 


caacatcctg 


cgtaaagata 


ccctggataa 


aatcagcgat 


1980 


gttagcgcga 


ttattccgta 


tattggtccg 


gcgctgaaca 


ttagcaatag 


cgtgcgtcgt 


2040 


ggcaatttta 


ccgaagcgtt 


tgcggttacc 


ggtgtgacca 


ttctgctgga 


agcgtttccg 


2100 


gaatttacca 


ttccggcgct 


gggtgcgttt 


gtgatctata 


gcaaagtgca 


ggaacgcaac 


2160 


gaaatcatca 


aaaccatcga 


taactgcctg 


gaacagcgta 


ttaaacgctg 


gaaagatagc 


2220 


tatgaatgga 


tgatgggcac 


ctggctgagc 


cgtattatca 


cccagttcaa 


caacatcagc 


2280 


taccagatgt 


acgatagcct 


gaactatcag 


gcgggtgcga 


ttaaagcgaa 


aatcgatctg 


2340 


gaatacaaaa 


aatacagcgg 


cagcgataaa 


gaaaacatca 


aaagccaggt 


tgaaaacctg 


2400 


aaaaacagcc 


tggatgtgaa 


aattagcgaa 


gcgatgaata 


acatcaacaa 


attcatccgc 


2460 


gaatgcagcg 


tgacctacct 


gttcaaaaac 


atgctgccga 


aagtgatcga 


tgaactgaac 


2520 


gaatttgatc 


gcaacaccaa 


agcgaaactg 


atcaacctga 


tcgatagcca 


caacattatt 


2580 


• r . . ctggtgggcg 


aagtggataa 


actgaaagcg 


aaagttaaca 


acagcttcca 


gaacaccat'^> 


2640 
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ccgtttaaca tcttcagcta taccaacaac agcctgctga aagatatcat caacgaatac 2700 
ttcaatctag actag 2715 



<210> 100 
<211> 904 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 100 

Gly Ser Glu Phe Met Pro lie Thr lie Asn Asn Phe Asn Tyr Ser Asp 
15 10 15 



Pro Val Asp Asn Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg lie Thr Gly Asn He Trp 
35 40 45 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 
100 105 110 



Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 ■ 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
. 130 135 140 



Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val 
145 150 155 160 



He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn. Asn. Thr.^^Phgi^Ala ^a iGJ^. Glu,..Glj/ Ph^^aGlrjr Ala 
180 " ' ' ^ 185 190 
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Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 



Thr Ser Asn He Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He 
290 295 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr lie Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin 
385 390 395 400 



Asn Gly Phe Asn He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
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Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Ala lie Asp Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys Arg Glu Leu Leu Val Lys Asn 
485 490 495 



Thr Asp Leu Pro Phe lie Gly Asp lie Ser Asp Val Lys Thr Asp lie 
500 505 510 



Phe Leu Arg Lys Asp lie Asn Glu Glu Thr Glu Val lie Tyr Tyr Pro 
515 520 525 



Asp Asn Val Ser Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu 
530 535 540 



His Gly Gin Leu Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu 
545 550 555 560 



He Leu Pro Gly Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn 
565 570 575 



Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser 
580 585 590 



Asp Asn Val Glu Asp Phe Thr Phe Thr Arg Ser lie Glu Glu Ala Leu 
595 600 605 



Asp Asn Ser Ala Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys 
610 615 620 



Val Asn Ala Gly Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp 
625 630 635 640 



Val Val Glu Asp Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp 
645 650 655 



Lys He Ser Asp Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu 
660 665 670 
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Asn He Ser Asn Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala 
675 680 685 



Val Thr Gly Val Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He 
690 695 700 



Pro Ala Leu Gly Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn 
705 710 715 720 



Glu He He Lys Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg 
725 730 735 



Trp Lys Asp Ser Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He 
740 745 750 



He Thr Gin Phe Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn 
755 760 765 



Tyr Gin Ala Gly Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys 
770 775 780 



Tyr Ser Gly Ser Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu 
785 790 795 800 



Lys Asn Ser Leu Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn 
805 810 815 



Lys Phe He Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu 
820 825 830 



Pro Lys Val He Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala 
835 840 845 



Lys Leu He Asn Leu He Asp Ser His Asn He He Leu Val Gly Glu 
850 855 860 



Val Asp Lys Leu Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He 
865 870 875 880 



Pro Phe Asn He Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He 
885 890 895 



He Asn Glu Tyr Phe Asn Leu Asp 
900 
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